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ABSTRACT 


In this thesis, a generic genetic algorithm (GA) is presented that is implemented 
on a reconfigurable computer. Our GA is implemented such that many problems can be 
solved by simply adapting the problem to the GA. For example, part of this process 
involves the customization of the fitness function of the given problem to the GA. The 
size of the problem is limited by the capacity of a field programmable gate array that is 
part of the reconfigurable computer. We apply this to bent functions, which are Boolean 
functions that are well suited for cryptographical applications and are extremely rare. 
Experimental results show the effectiveness of this technique. Different methods are 
used to discover bent functions. These methods take advantage of the properties of bent 
functions to reduce the total search space. This allows a brute force search to be 
conducted on the reduced search space to locate the set of bent functions in that search 
space. Two different methods are used to reduce the search space. The first is through 
rotationally symmetric functions, which reduces the number of bent function that can be 


found, while the second is by the degree of the function, which locates all bent functions. 
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XIV 


EXECUTIVE SUMMARY 


In this thesis, several methods are shown to locate bent functions. Bent functions 
are well suited for cryptographical applications, such as in the substitution box in the 
DES encryption standard, or the Grain-128 cipher [7]. To the best of our knowledge, this 
is the first time that a reconfigurable computer has been used to locate bent functions on 
more than six variables. Due to the repetitive nature of an algorithm needed to determine 
if a function is bent, reconfigurable computers are ideally suited to locate them, 


especially when compared to a general purpose computer. 


As a result of this thesis, the Naval Postgraduate School (NPS) now has 
5,425,430,528 6-variable bent functions, and 1,933,312 8-variable ROTS bent functions 
for use in additional thesis and research work. Additionally, calculators have been 
created to allow a nonlinearity calculation to be made on 8- and 10-variable functions. 
This was not previously possible at NPS due to a lack of memory error that occurs when 
attempting to complete the nonlinearity calculation using a previous instantiation of the 


algorithm. 


Multiple ways were examined to locate bent functions. Since there is not enough 
time to conduct a brute force search of all functions on more than four variables, three 
different means were used to restrict the search space to locate bent functions. The first 
was by examining rotationally symmetric (ROTS) functions. The second method was to 
search for them according to the degree of the function as revealed by its algebraic 
normal form. This is accomplished through sequentially enumerating all of the functions 
according to their degree through an index. This method revealed an interesting fact that 
bent functions commonly occur with consecutive indices. Next, a genetic algorithm was 
used to create a sieve to locate ROTS bent functions. These results show that, through a 
well-designed chromosome, fitness function, crossover point and minimum fitness value, 
the ability to locate bent functions is drastically increased. Finally, a non-traditional 
approach towards GAs is taken to identify bent functions on 4 variables using other 4 


variable bent functions. 
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I. INTRODUCTION 


A. CRYPTOLOGY APPLICATIONS 


Two of the key aspects of a cryptographic system are confusion and diffusion 
[14], [10]. Diffusion is the process by which repetitive information is “dissipated” over 
an entire message. A simple case of confusion would be to replace one letter, “E’, with 
another letter, “K’. This method does little to improve the secrecy of the message since 
the frequency graph of the letters has not changed. In order to make this method more 


practical, several substitutions must be made [14]. 


In a paper frequently described as “small” and “beautiful,” Rothaus introduces a 
new type of function known as a bent function [5], [6], [13]. The term “bent” was 
probably chosen by Rothaus because it suggests the opposite of “linear” [3]. These 
functions are most notable because they have the highest nonlinearity among all functions 
on the same number of variables. Because of this, they are well suited for 
cryptographical applications, including being used as part of the substitution box in the 
DES encryption, or the Grain-128 cipher, to mention just a few [7]. 


B. GENETIC ALGORITHMS (GA) 


Several problems that need to be solved in a business environment involve a cost- 
benefit analysis. As various scenarios stress the variables differently, desirable 
components might exist in different solutions. GAs take advantage of two solutions 
yielding good answers by merging the parameters of the two solutions. Although the 
groundwork on GA started in the 1950s, the interaction between a conventional 
microprocessor and a reconfigurable computer is allowing GAs to enter a new realm of 
truly parallel operations. Although a GA has been implemented on an FPGA, this thesis 


will examine their implementation in a more parallel manner [19]. 


C. RECONFIGURABLE COMPUTING ON THE SRC-6 


The SRC-6 is a microprocessor based computer that contains additional boards 
known as Multi-Adaptive Processing (MAPs). Each MAP contains three Xilinx Field 
Programmable Gate Arrays (FPGAs), two running user programs, and the third 
controlling the others [16]. To understand how this project helps to locate bent functions, 
it is necessary to know how the SRC-6 operates. The SRC-6 operates via a dual core 
general purpose CPU communicating with the FPGAs. There are two methods in which 
the FPGAs can be programmed to solve a problem. The first is through a hardware 
description language (HDL), either Verilog or VHDL, and the other is through the high 
level languages C or FORTRAN. The remainder of this paper will focus on 
programming the SRC-6 with Verilog and C. Regardless of whether the FPGA is 
programmed in C or HDL, where the problem solution design is written, the interface 
between the microprocessors and the FPGA is through C code. Figure | demonstrates 


the interaction between the various components of the SRC-6. 
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main.c / 


; subr.mc macro.v 
main.cc 
= blik.v 


Figure 1 SRC-6 data flow path 












































The programmer is able to access the FPGAs on the SRC-6 by first writing 
C/C++ code that is executed on the general purpose computer. When he wants to have 
the FPGAs process data, he merely makes a C style function call that invokes the SRC-6. 
Next, program execution is passed to a subroutine that is written in C. This subroutine, 


subr.mc, is compiled to operate on the FPGA. 


When a problem is solved through Verilog, the programmer designs the circuit as 
he would for any FPGA. The file describing this circuit is macro.v. This module has two 
interfaces to the subr.mc C code. The first is a blackbox interface (blk.v) that would be 


analogous to a function prototype in C. This interface merely restates the module name 


2 


and port declarations from the module source code. The other interface (info) includes 
information about the macro, such as if it is pipelined or stateful. It also specifies its 
latency and additional control signals that need to be applied to the circuit, such as clear 
and clock signals. This code is directly parsed to generate the interface between the C 


code and the Verilog code. 


D. GOAL OF THIS THESIS 


The goal of this thesis is to determine if GAs are useful in finding bent functions. 
In the process of doing this, several different methods of looking for bent functions will 
be examined. These methods restrict the search space to make it possible to enumerate 
all of the bent functions. In doing so, it is desired to see how these methods can be 


adapted for use in GAs. 


E. THESIS ORGANIZATION 


This thesis is organized as follows. Chapter II describes bent functions and how 
to find them. Chapter III is a discussion on genetic algorithms, and how one is 
implemented on the SRC-6. Chapter IV is a discussion on how bent functions were 
found for this thesis. Chapter V is a summary of the results. Appendix A contains 
additional information on the SRC-6. Appendix B are the lessons learned while 
conducting the research for this thesis. Finally, Appendix C is the Verilog code for the 


genetic algorithm. 
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I. BENT FUNCTIONS 


A. BACKGROUND 


Bent functions contain many properties that must be examined. These properties 
are important since they are used to determine if a function is bent, and provide insight in 


how to construct a function that might be bent. 


1. Definitions 


a. Linear or Affine Function 


A linear function is the constant 0, or the exclusive OR of one or more 


variables. An affine function is a linear function, or the complement of a linear function. 
b. Nonlinearity (NL) 


The nonlinearity of a function is the least number of bits that are required 


to be changed in order to convert the function into some affine function. 
Cc. Bent Function 


A function on an even number of variables is called bent, if it has the 


maximum nonlinearity among all other functions on the same number of variables. 
d. A-class 


Suppose f is a bent function and a is an affine function. It has been 
shown that g = f ®a, is also bent [5]. Because of this, we say that f and g are in the 


same A-class. 


e. Truth Table (TT) 


The truth table of a function f, specifies the value of f for all 





assignments of values to the variables. For example: the TT of f, =%,x,+%,x,+4,X, is 


1110, where x,x, = 00, 01, 10 and 11 map to 1, 1, 1, and 0, respectively. 

f Algebraic Normal Form (ANF) 

The ANF of a function f is the exclusive OR of product terms, where all 
variables occur uncomplemented. For example, consider the ANF of f, =1® x,x,. 

Q. Degree 


The degree of a function is the maximum number of variables that exist in 
any of its terms as expressed in the ANF. For example, let f and g be functions on 3 
variables, x,, x, and x,, f=x,.x,®x, and g=x,x,x,®1. Then, f and g have degree 


2 and 3, respectively. 
h. Co-functions 


The TT form of a function f can be considered as a double word 
So-sx, fiss, Which are f with x, replaced by 0 and | respectively. Each of the words that 


comprise the TT form of a function is called a co-function. The co-function containing 
the MSB is referred to as the “high” co-function, and the other co-function is referred to 


as “low”. 


2. Properties 


a. Rotationally Symmetric (ROTS) Functions 


A function is ROTS if f(a),q,...4,5,4,,)=f(G,a,...,d,;,4)). The 


number of bits in an n-variable ROTS function is R [5]. 


The 


Maximum Nonlinearity 


maximum 


NL, _(n)=2"-2?” [13]. 


max 


Cc. 


Weight 


nonlinearity 


of an 


n-variable function 


The weight W, of a function f is the number of ones in its truth table. A 


bent function f has weight W, (n)=2"'+ 22 [13]. 


d. 


Summary 


Table 1 provides a listing of the properties on 4, 6, 8 and 10 variables. 









































n 2" R a NL vax (1) W, 
4 16 6 32 6 8+2 
6 64 14 128 28 32+4 
8 256 36 512 120 128+8 
10 1024 108 2048 496 512+16 
Table 1. 


B. REPRESENTATIONS 


Bent functions properties by number of variables 


1. Truth Table 


Consider Table 2 for the expression x, on three variables. It has a truth table 


representation of 0x33. The MSB corresponds to the table entry of, x,x,x, =111, while 


the least significant bit (LSB) entry corresponds to entry x,x,x, =000. The hexadecimal 


representation of 00110011, =0x33 has the MSB (Most Significant Bit) being written as 


the leftmost bit. 


























poe a xX, or x, Ol 
000 1 (LSB) 
001 1 
010 0 
O11 0 
100 1 
101 1 
110 0 
111 0 (MSB) 














Table 2. Truth table 


Zs Algebraic Normal Form 


A function is expressed in its algebraic normal form (ANF) by: 
f =c1Dec,x, Bc, x, Be,xX,xX, De,x, BCX, X; DB CX, X, BCA XyXq 


Where c,,C,,...,c, are the values from the ANF table where c,corresponds to the MSB. 
Consider the expression x,x,®x, in Table 3. When it is represented in its ANF, its 
expression is 01000010, =0x42. Again, the left most bit in the hexadecimal notation is 


the MSB. However, this time the entry 1 or 0 in the truth table corresponds to whether 
that term exists or not in the expression. For example, the expression x,x, ®x, has two 
terms, x,x, and x,. Thus, the entries 110 and 001 have a one in their associated column 
to signify that the term exists in the expression. The ANF is a useful representation, 
because it can be used to identify bent functions of the same A-class. If two bent 


functions are of the same A-class, then all of their non-linear term coefficients will be the 


same. 



































Term Hi Xst, X,X, Ox, or 
op, Sane Cte ep a, 
roa 000 0 (LSB) 
CH 001 1 
C4, 010 0 
C3X,X, O11 0 
ee 100 0 
C5X,X3 101 0 
CaS: 110 1 
Cua 111 0 (MSB) 
Table 3. ANF table 
3. Transeunt Triangle 


The transeunt triangle is a data structure that allows the conversion from the truth 
table form to the ANF, and vice versa [18], [2]. Regardless of the mode of operation, the 
transeunt triangle receives and processes its data in the same manner. The data for the 
current format is placed along the bottom row of the triangle, with the MSB being the 
right most bit. The bits on the next higher row are created by the exclusive ORing of the 
adjacent bits in the row below it. The ordering of these bits corresponds to the truth or 
ANF table shown in Table 3 being rotated counter-clockwise90°. The output table is 
read along the left side of the triangle. Its corresponding values would be as if the above 


tables are rotated 120° counter-clockwise. Figures 2, 3 and 4 show the transeunt triangles 


for Tables 2 and 3. 







































































On 4 
LXLKY5 
Ox OX4 
Z 0/10 
LKO 
xX s¥ 1010/0 
a ae 0)010)]0 
PL OKO o[olololo 
aL OKO 1/14;)14/)1)1~/) 41 
LX O 0o;1/;0;1;,0/;/1)0 
“4 fA OOF A as soe le6 
x<lOolr/olrlo/rlol|— 
x<lOolo/rl|rlolo/r/— 
xlOololololrlri/-i[- 




















LSB MSB 


Figure 2. Transeunt triangle conversion for x, to x, ®1 
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Figure 3. Transeunt triangle for x,x, ®x, to Xx, +%,X, +X,x,x, 
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Figure 4 Transeunt triangle for x, ®1 to x, 


The transeunt triangle shown in Figure 4 is the inverse of the Figure 2 . On initial 





observation the truth table expression would be %,X,xX, + %,X,xX,+%,X,%,+%,%X,x,. This, 
however, can be easily simplified as in Figure 5. 
XN Xe PM Xo y FAH; H, HHH, = 
XX (Hy + Xz) + X,X,(%, + x,) = 
XX + XX, = 


X,(%,+%,) = 


x4 


Figure 5 Truth table simplification 


C. BENT FUNCTION DISCOVERY 


There are several different approaches that can be used to discover bent functions. 
Regardless of the tool used to restrict the search space to a manageable size, this research 


uses the same algorithm to determine if a function is bent. 
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1. General Case 
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Figure 6 General case nonlinearity calculation 


Figure 6 shows a general algorithm to determine the nonlinearity of a function. 
The function under test is exclusive ORed with each of the affine functions, and the 
number of ones in each function is counted. The minimum number of ones is then 
determined over all of the 2”*' calculations which correspond to each of the affine 
functions. The resulting number is then compared to the maximum nonlinearity for the 


given number of variables. If they are the same, then the function is bent. 
2. Brute Force 


The easiest method to determine which functions are bent is to perform a brute 
force attack. By doing so, the nonlinearity of all functions in S is calculated. By doing 
so, one also generated a histogram showing the distribution of the nonlinearities. This is 


beneficial in that it also verifies the maximum nonlinearity equation, NL_.(n). This 


max 


method is time consuming for n < 6 and impractical for n26. 
3. ROTS 


It has shown that rotationally symmetric functions are rich in bent functions [5]. 
This means that by enumerating only the ROTS functions, which is considerably smaller 


than the total number of functions, bent functions can be more readily discovered. The 


2 


first step in this process is to determine all of the ROTS functions. Next, each bit is 
mapped to an index. Table 4 is the Verilog code that produces the necessary mapping for 


n=4. 







































































assign 0] = RSI[ 0]; 
assign 1] = RSI[ 1]; 
assign 2] = RSI[ 1]; 
assign 3] = RSI[ 2]; 
assign 4] = RSI[ 1]; 
assign 5] = RSI[ 3]; 
assign 6] = RSI[ 2]; 
assign 7] = RSI[ 4]; 
assign 8] = RSI[ 1]; 
assign 9] = RSI[ 2]; 
assign 10] = RSI[ 3]; 
assign 11] = RSI[ 4]; 
assign 12] = RSI[ 2]; 
assign 13] = RSI[ 4]; 
assign 14] = RSI[ 4]; 
assign 15] = RSI[ 5]; 


Table 4. | ROTS mapping 


From this table, a 6-bit counter can be applied to the rotationally symmetric index 
(RSI) to generate the truth table representation of the ROTS functions. This 
representation generates the function under test, which is applied to the general case 


algorithm to determine the nonlinearity as shown in Figure 7. 
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Figure 7 ROTTS nonlinearity calculation 
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4. By Degree 


It is known that the highest degree of a bent function is ; [16]. Furthermore, in 


the ANF of a function, the linear terms can be ignored, since they only differentiate two 


bent functions in the same A-class. This, significantly reduces the total search space to 
terms of degree 2, 3, ..., and 5 For the case of n=6, the highest degree of a bent 


function is 3. This means that the only functions that must be enumerated are those of 


degree 3 and 2. Thus, the total number of bits in the search space is now 


6 6 
a (5) =15+20=35. This results in only 1.86x10°'% of the original search space of 


2™ functions, if the search is exhaustive. Figure 8 shows the original algorithm as 


modified to accomplish this. 
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Figure 8 Nonlinearity calculation by degree 


=i Complement Optimization 


The number of affine functions that need to be evaluated can be cut by 2. This is 
done by recognizing the relationship between nonlinearity of a function with respect to a 
particular affine function, and that affine function’s complement, and will be shown by 
example. For any given n variable bent function, there are 2”"' affine functions. 
However, because of the linear nature of the affine function, it is only required to 
enumerate 2” of them since their complement will yield the other 2” affine functions. 


This property relationship can be applied to the nonlinearity of a function. 
14 


Consider the case when n=4, and the affine functions Ox0000and its 
complement OxFFFF . Consider a function f =OxC3D7. Its nonlinearity with respect 
to0x0000 is 10, and its nonlinearity with respect to OxFFFF is 6. Recalling that there 
are 16 bits in each function on four variables, and given the affine function 0x0000, we 
can determine the nonlinearity of its complement affine function, OxFFFF, by 
subtracting its nonlinearity from 16. For this case, 16—10=6. In general, the below 


formula can be utilized to determine the minimum nonlinearity, NL,,,, of a function 


‘min ? 


given the NL one of the affine functions. 
NL pin = min(NL, 2”*' — NL) 


Through this calculation, the following effects are observed on the circuitry 
required to implement the algorithm. The number of exclusive OR gates required is 
reduced by %, along with the number of ones count calculations. However, the number 
of minimization calculations required is constant, because an additional minimization is 
implemented with the aforementioned subtraction. 2” subtraction units must be added to 


the NL... calculation. However, the complexity of that operation is insignificant 


compared to the required circuitry to implement the ones count algorithm for the affine 


functions not directly tested. 
D. SUMMARY 


This chapter describes bent functions, and introduces their various properties. It 
is through application of these properties that locating bent functions is possible, given 
their rarity. The next chapter discusses genetic algorithms in preparation for how to 


locate bent functions. 
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HI. GENETIC ALGORITHMS 


A. BACKGROUND 


Genetic algorithms have their basis by what is seen in nature. The three main 
processes that will be discussed are survival of the fittest, crossover and mutation. 
Survival of the fittest is similar to the idea that the stronger animals in a herd are more 
likely to live and go on to produce children for the next generation. Crossover is based 
on the possibility that if two parents have desirable traits, their children may have a 
combination of those desirable traits. Finally, mutation is the idea that a change in a gene 


might make the animal more resilient for its environment. 


1. Definitions 


a. Chromosome, Element or Member 


A potential solution to a problem is encoded as a string and is referred to 
as the chromosome. Strings can be any combination of characters, but this thesis will 
only consider the case of binary digits. A chromosome may also be referred to as an 


element. 
b. Gene 
In this thesis, a gene is each character of the chromosome. 
c. Value 


The value is a numerical representation of the chromosome. This can be 
created in any number of methods. One common method is to directly convert the 
chromosome into its integer representation. In a genetic algorithm involving 
trigonometric functions, for example, the value could represent some fraction between 


—7 and z. 
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d. Fitness Function 


The fitness function is a function that converts the value into a fitness 


value. 
e. Fitness Value 


The fitness value is a number that describes how close the chromosome is 
to the optimal solution. The solution could be a maximum, minimum, local maxima or 
local minima, depending on the problem being solved. Identifying a local maxima or 


minima is of interest when utilizing a GA in a cost savings problem. 
f Population 


The population is a group of elements that exist within the genetic 


algorithm. 
g. Generation 


Genetic algorithms operate iteratively. Each iteration is referred to as a 
generation. Generally, the population at the start of a generation is the population at the 
end of the previous generation. For the case of the first generation, the population is 


randomly generated. 
h. Survival of the Fittest 


Survival of the fittest is a process by which chromosomes are selected by 
their fitness value. During this process, some elements are removed from the population. 


This is analogous to nature in that the weaker species die off. 
i. Crossover 


Crossover is the process by which two elements that were selected during 
survival of the fittest combine to produce two new elements. The combination occurs by 


randomly picking a gene position. All of the genes to the left of this gene in chromosome 
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a are combined with all of the genes to the right of the same position in chromosome b 
to create a'. This process repeats to create b' with the unused portions of a and b. 
This is analogous to nature in that two parents with desirable characteristics might go on 


to produce children that also have desirable characteristics. 


J. Selection 


There are several ways to determine which of the strings are selected for 
crossover. One method is to only take the elements with the best fitness values. 
Unfortunately this has the effect of removing some chromosomes from the solution pool 
that might actually be needed to arrive at the optimal solution. This can be countered 
through the use of the “roulette wheel” algorithm which assures that all elements of the 


population have at least a small chance of being selected. 


k. Mutation 


Mutation is the process by which the genes of a single element may be 
changed by a random process. This is analogous in nature to an event causing a gene to 
change in an animal thus making it more suited for survival. An example in the 


following section demonstrates the need to implement mutation. 


L String Generation 


In a GA, several different chromosomes exist at one time. This collection 
is referred to as the population. The implementation of the GA used in this thesis has a 
population of 16, based on the number of items that can easily be sorted with the Batcher 
sort. Initially, the population is created through a random string generator. Once this has 
been accomplished, each of the strings is evaluated with the fitness function. The results 
of these calculations are then sorted from the highest fitness value to the lowest. This 
ordering is then used to help determine which of the strings will be selected for crossover. 


As such, this implements the concept of “survival of the fittest’. 
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2. Example of a Genetic Algorithm 


The following discussion on the implementation of genetic algorithms is based on 
GAs taken from [4]. In order to solve a program using GAs, the potential solution must 
first be encoded as a chromosome. Consider a problem of finding the maximum value of 
f(x)=x° for 0< x< 4,095. Assume x is realized as a 12-bit binary number. Its value 
is directly derived by converting the chromosome into an integer. Its fitness function is 
simply the square of its value. Thus, the higher the fitness function result, the closer to 
the solution you are. Obviously, the best solution to this problem is the string with 12 
ones in it. Figure 9 shows the processes by which the GA operates. Finally, the example 


will not implement mutation, and while doing so demonstrates its need in order to 


achieve the maximum. 
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Figure9 GA algorithm 


The GA starts with the creation of random strings, which represent each member 


in the population. Each of the elements then has their values and fitness values calculated 


as shown in Table 5. 
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Member | Chromosome Value | Fitness value 
1 110101100100 | 3,428 | 11,751,184 
2 010100010111 | 1,303 | 1,697,809 

3 101111101110 | 3,054 | 9,326,916 

4 010100001100 | 1,292 | 1,669,264 

5 011101011101 | 1,885 | 3,553,225 

6 101101001001 | 2,889 | 8,346,321 

7 101011011010 | 2,778 | 7,717,284 

8 010011010101 | 1,237 | 1,530,169 





Table 5. Start of 1* generation, From [4] 


Initially, the population consists of the 8 randomly generated chromosomes. 
Through survival of the fittest, members 1, 3, 6 and 7 are chosen, since they have the 
highest fitness value. Members 1 and 3 are then chosen randomly to crossover at the 


second bit from the left, while 6 and 7 crossover at the 6" bit. These positions are 


marked in the following table with a “/’ as shown in Table 6. 
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Member Chromosome Value | Fitness value 
1 11/0101100100 | 3,428 | 11,751,184 
2 10/ 1111101110 | 3,054 | 9,326,916 

3 101101 /001001 | 2,889 | 8,346,321 

4 101011 /011010 | 2,778 | 7,717,284 

5 111111101110 4,078 | 16,630,084 
6 100101100100 2,404 | 5,779,216 

7 101101011010 2,906 | 8,444,836 

8 101011001001 2,761 | 7,623,121 














Table 6. Start of 2"¢ generation, From [4] 


By applying only survival of the fittest to the start of the 2" generation, the 
following table is constructed. In Table 7, it can be seen that the least significant bit in all 
of the chromosomes is zero. As previously mentioned, the maximum value for this GA is 
a string with all ones. Thus, regardless of where crossover is performed, the least 
significant bit in each chromosome will remain zero preventing the maximum from being 


achieved. In order to prevent this from happening, mutation is necessary. 

















Member | Chromosome Value | Fitness value 
1 110101100100 | 3,428 | 11,751,184 
2 101111101110 | 3,054 | 9,326,916 

3 111111101110 | 4,078 | 16,630,084 
4 101101011010 | 2,906 | 8,444,836 




















Table 7. After survival of the fittest, 2" generation, From [4] 
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a Advanced Operations 


a. Selection Methods 


Previously, when parents were selected for crossover, they were chosen 
only by their fitness value. This prevents good genes that exist in chromosomes with a 
poor fitness value from propagating themselves into later generations. By providing a 
detailed selection method, a means will exist that makes it possible for good genes in bad 


chromosomes to propagate. 


The roulette wheel algorithm is a process by which any of the 
chromosomes may be selected for crossover. It is based on an idea that the chromosomes 
are chosen with a probability that depends on their fitness value. Additionally, it allows 
for chromosomes with poor fitness values to be selected, albeit considerably less 


frequently. 


It begins by determining the sum of all of the fitness values, which are 
assumed to be non-negative. Next, a random number is generated that is between 0 and 
the sum of the fitness values. Next, a running total is initialized to 0, and each member of 
the population has its fitness value added to it. The fitness values of subsequent members 
of the population are added to the total until the running total is equal to or greater than a 
randomly generated number. The last added chromosome is then selected for crossover. 
This process continues until enough chromosomes have been selected to cause the 


population to be filled. 
b. Elitism 


The roulette wheel provides an approach that ensures that any of the 
chromosomes has the opportunity to reproduce and be part of the next generation. The 
side effect of this is that sometimes an ideal solution is removed from the population. 
Elitism is the concept that prevents this from happening. It allows certain solutions 
which meet specified criteria to remain in the population. This process overcomes its not 


being selected for crossover, or mutation changing a gene [4]. 
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c. Selective Crossover 


As previously discussed, one step in crossover is to randomly select a 
point at which to perform crossover. Consider the traveling salesman problem in which a 
salesman needs to travel through a series of cities in the shortest possible trip. Suppose 
that there are six cities that need to be visited, named a, b, c, d, e and f. The chromosome 
is composed of the order in which the cities are visited. Thus, two possible chromosomes 


would be abcdef and debcfa. If these two chromosomes were crossed in between the 
third and fourth genes, the resulting chromosomes are abccfa and debdef. In each of 


these cases, two cities are visited twice during the tour, and neither is a solution to the 
problem. Because of this issue, additional care should be taken when performing 


crossover to ensure that the resulting chromosomes are valid solutions [4]. 
B. IMPLEMENTATION ON THE SRC-6 


The specifics of the problem described in this section deal with a GA that solves a 
packing problem. Constructing a ROTS function, that might be bent, can be viewed as 
this packing problem. The goal of this packing problem is to find a combination of 
objects that weigh a total of 28 pounds. There are four types of items to be packed, 9 
items weighing 6 pounds, 2 at 3 pounds, | at 2 pounds and 2 at 1 pound. This results in a 


chromosome that can be described with 14 binary digits. 


Note that we can divide this problem into two subproblems. The first subproblem 
is to create a subtotal of 4 pounds. This can come from two possibilities. The first is that 
exactly one 3 pound object and one | pound object are chosen. The other possibility is 
that the 2 pound and both 1 pound objects are chosen. The other subproblem is to create 
a subtotal of 24 pounds. There are also two ways to do this—four 6 pound, or three 6 
pound and both 3 pound objects. In the latter case, this prohibits the use of the three 
pound and one pound objects to create 4 pound subtotal. This process describes the 
fitness function. Each subproblem contributes a score of 120 to the fitness value, thus 
resulting in an optimal fitness value of 240. All other combinations of selected objects 


result in a fitness value of less than 240. For example, consider three chromosomes that 
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contain only 6 pound objects. The first, second and third chromosomes contain three, 
four and five 6 pound objects respectively. Since the first and the last chromosomes do 
not have the required number of 6 pound objects, their fitness sub-value would be 90. 
The second chromosome, which contains the correct number of 6 pound objects, would 
thus have a fitness sub-value of 120. This process holds true for all combinations of 


genes, and will not be discussed further. 


Figure 10 shows the data flow path for the GA and the elements described in the 
previous section on GA are implemented. The sorting function is the method used 


utilized to facilitate survival of the fittest. 


Generation 
creation and mutation 
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Figure 10 GA implementation organization 


1. Generation Creation 


Generation creation includes several processes. There are three primary 
functions. The first is the construction of new chromosomes from pseudo-random 
numbers. Second is the calculation of the fitness value for each of the chromosomes. 


Finally, the third process is ensuring genetic diversity. 
a. Generation Creation 


Generation creating is achieved through the circuit in Figure 11. This 


circuit is representative of how each of the 16 different chromosomes is created for the 
ZS 


GA. The number of chromosomes is based on the ability to apply the Batcher sort to 
them. A larger population is possible if the Batcher sorting module is expanded to 
accommodate the population. Furthermore, it shows how the clear unit introduces new 
strings into the population. A linear feedback shift register (LFSR) initializes the 
population with pseudo-random chromosome strings. The LFSRs are initialized through 
a clear signal generated during the first generation. Additionally, for each position in the 
population, the corresponding chromosome from the previous generation has its fitness 
value determined. Once the fitness value is created, it is appended onto its chromosome’s 


associated bit string. 


Previous 
generation 









Compare 
and clear 





Figure 11 Generation creation 


b. Compare and Clear Unit 


The compare and clear unit evaluates the previous generation fitness 
function against a user-specified constant, which is capable of being specified at the 
keyboard. It serves to replace (clear) the chromosome from the previous generation, if it 
does not reach a threshold value, with a new chromosome generated by the LFSR. This 
process helps the implementation of the survival of the fittest concept previously 


discussed. The CLEAR signal that is generated during the first generation is also used by 
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this unit causing string initialization when the GA starts. Finally, it is also capable of 
generating additional clear signals can be generated through the half-life and Order 67 


circuits. 
c. Half-life 


Half-life operates on the idea that, on each generation, the chromosome at 
a particular element position is loaded into successive registers. If those registers contain 
the same value over 3 generations, a clear signal is generated forcing a new chromosome 
into that position in the generation. The “3 generations rule” was arbitrarily chosen to 
allow sufficient time for fit chromosomes to crossover and propagate throughout the 


population. This implementation is shown in Figure 12. 
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Figure 12 Half-life circuit 


d. Order 67 


Order 67 is based on the idea that if adjacent members in the top four 
chromosomes in the population are the same (clones), there is a lack of diversity, and 
thus, a clear signal should to be generated in order to replace an old chromosome by a 
new one. This is of concern due to the small population size. This is implemented 
through a set of simple comparison circuits whose output drives an OR gate that provides 


the CLEAR signal as shown in Figure 13. 
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Figure 13. Order 67 


e. String Generation 


The circuit in Figure 14 is an example of a general purpose LFSR that was 
used in EC4830 from the course notes, and from an exam question. Its primary 
advantage is that it can produce a maximal run sequence provided the correct tap 
positions. The number of different outputs that an LFSR can produce is dependent on its 


tap positions. Since an LFSR requires at least 1 bit to be one at all times during its 


operation, the maximal run of an n bit LFSR is 2” -1. 
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Figure 14 General case LFSR 


The code below shows how the LFSR is parameterized. The parameter n 
determines the number of bits in the shift register. The tap parameters allow for up to 4 


taps, dependant on the particular LFSR being implemented. The tap position came from 
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a predetermined table [17]. The module that instantiates the LFSR module specifies the 
tap parameters. Therefore, tap parameters are not shown in this section of code. When 
implemented, the tap positions were at 13, 4, 2 and 0. Figure 15 is the Verilog code that 


was written to realize the general case LFSR shown in Figure 14. 


module LFSR(CLR, SET, EN, CLK, q): 
parameter n=6; 
//parameter taps=4; 
//Parameter n corresponds to the number of bits in the LFSR 
//Each of the tapX parameters directly corresponds to the maximal tap 
//as shown in Table 3.8 of Dixon 


//In order to properly use the LFSR, the register must first be initialized with the 
//CLR and SET inputs 


//The value in SET is the first value stored in the register 
//The value of CLR must be the NOT of SET 
//SET and CLR must be LOW after initialization for the LFSR to sequence 


parameter tap0=0; 
parameter tap1=0; 
parameter tap2=0; 
parameter tap3=0; 
input [n-1:0] CLR, SET; 
input EN, CLK; 
output [n-1:0] q; 
wire [n-1:0] q; 
wire inwire; 
assign inwire = q[tap0]“q[tap1 ]*q[tap2]*q[tap3]; 
dff_cse ff0(inwire, CLR[0], SET[0], EN, CLK, q[0]); 
genvar k; 
generate 
for (k=1; k<n; k=k+1) 
begin: ea_ff 
dff_cse fi(q{k-1], CLR[k], SET[k], EN, CLK, q[k]); 
end 
endgenerate 
endmodule 


Figure 15 General case LFSR Verilog code 


Figure 16 shows the code that is necessary to instantiate the LFSRs. Since 
the LFSR needs to retain its state from one cycle to the next, it uses the VALID and ITER 
control signals to allow the LFSR to only shift state once per call to the macro. The 
inputs to the macros allow for different random number seeds to be specified from 


main.c. 
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module lfsrs( 
parameter 


parameter 
parameter 
parameter 
parameter 


input [n-1:0] rndo, 


input CLR, VALID, 


rnd0O, rndl5, 

n=6; 

AAA=5; 

BBB=4; 

CCC=2; 

DDD=0; 

rnd15; 
ITER, CLK; 


CLR, VALID, ITER, CLK, w_rng0, w_rngl5); 


output [n-1:0] w_rng0, w_rng15; 
wire [n-1:0] w_rng0, w_rngl15; 


wire [n-l 
wire [n-l 


assign sets[0]=rnd0; 


:0) clears 


:0) sets 


{15:0]; 


(15:30] 7 


assign sets[15])=rnd15; 
assign clears[0]=~sets[0]; 
assign clears[15]=~sets[15]; 


defparam 
defparam 
defparam 
defparam 
defparam 


defparam 
defparam 
defparam 
defparam 
defparam 


endmodule 


rng0.n=6; 
rng0.tap0=AAA; 
rng0.tap1=BBB; 
rng0.tap2=CCC; 
rng0.tap3=DDD; 
LFSR rng0(clears[0]&{6{CLR}}, 


rngl5. 
rng15. 
rng15. 
rngl5. 
rngl5. 


f 


n=6; 


tapO=AAA; 
tap1=BBB; 
tap2=CCC; 
tap3=DDD; 
LFSR rngi5(clears[15]&{6{CLR}}, sets[15]&{6{CLR}}, VALID&ITER, CLK, w_rng15[5:0]); 


sets[0])&{6{CLR}}, VALID&ITER, CLK, w_rng0[5:0]); 


Figure 16 LFSR instantiation code 


Numerical Representation 


As previously mentioned, there are 14 genes in each chromosome for this 


GA. The genes are organized in the chromosome such that genes representing an object 


with the same weight are adjacent to one another. Table 8 shows an example of the 


layout of the chromosome’s genes with respect to its weight. 
























































Bit 13. | 12 }11 |10 |9 8 7 6 5 4 3 2 1 0 
Weight | 6 6 6 6 6 6 6 6 6 3 3 2 1 1 
Table 8. Chromosome format 
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g. Fitness Function 


The fitness function was described briefly in the introduction to this 
section. The flow chart in Figure 17 is representative of its implementation in Verilog. 
The figure refers to lookup tables that will not be discussed in detail. The basis for the 
tables is on how much of each of the fitness sub-problems has been correctly solved. The 
maximum fitness is 240 and the minimum is 0. The maximum is based on each of the 
sub-problems receiving a fitness sub-value of 120. The width of the fitness value is 8 


bits. 
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Figure 17 __ Fitness function flowchart 


2. Sorting 


The need to sort the fitness values in the population is complicated by two factors. 
When the fitness values are sorted, their corresponding chromosomes must also be 
swapped. If this does not happen, the fitness values would lose their meaning. The 
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second factor is based on the nature of operating the GA on an FPGA - that is the need to 
sort the chromosomes in hardware. This is why a parallel sort, such as the Batcher sort, 
was chosen [1]. Furthermore, because it is implemented on specialized hardware, an 


FPGA, it is able to take full advantage of the parallelism of the Batcher sort. 


Figure 18 is taken from [9]. Each horizontal line represents an element in the 
population. The vertical arrows represent a comparison and swap, when required, 
between the two elements. The well-trained eye will notice the symmetry involved in 
this sort. Simply put, sorting 16 elements first requires the sorting of two sets of 8 
elements and then a merging of the two sets. Likewise, sorting 8 elements first requires 
sorting two sets of 4 elements, and so forth. Of particular concern is ensuring that all 
paths through the sorting network have the same pipeline length. This is easily seen on 
element eo. After the two 8 element sets are sorted either eo or eg is the largest. Once 
they are compared, and swapped if required, no further comparisons need to be made 
with eo. Since, after each comparison and swap is made, the resultant values are loaded 
into registers, those elements not compared in a clock cycle must also be loaded into 
registers to ensure all data pipelines are of equal length. There are 4 comparisons in the 


shortest path, and 10 comparisons in the longest path. 
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Figure 18 16 element Batcher sort, From [9] 
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Each arrow in Figure 18 is realized by the sorting element shown in Figure 19. 
As previously mentioned, it is important to maintain the relationship between the 
chromosome and its fitness value when the fitness value is sorted. The comparison 
between the bits representing the fitness values determines whether or not a swap will be 


made. 





bprime[21:0] 


Figure 19 Swapping element for sorting 


3. Crossover and Mutation 


There are two main parts to the crossover and mutation section, namely crossover 
and mutation. There is an additional helper module that provides a scalable means to 


increment the addresses of a ROM. 
a. ROM Address Control 


Due to the use of several ROMs in this GA, a common circuit was created 
to control accessing their elements and is shown in Figure 20. This allows easily scaling 
the GA to allow ROMs containing more words. The first element is an adder, which 
merely adds 1 to the previously used address. The multiplexer is controlled by the 


CLEAR signal that is generated during the first generation. When it is high, the output of 
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the multiplexer is 0, thus providing a means to initialize the address register. As 
previously mentioned, the address register provides the adder with the address to be 


incremented, and which value from the ROM that is to be accessed. 


crossover_mut_rom 


adr[?:] yal[31: 0] 


machine 








adr_3[7:0] 





adr[7: 0] 





Figure 20 ROM address control 


b. Crossover 


As previously mentioned, the need to select members of the population for 
crossover is essential for proper operation of the GA. Although the roulette wheel 
method provides for a good way to select parents, it is difficult to implement in the 
constant time needed for this pipelined problem. Furthermore, the need to generate 
random numbers across a range that is unspecified until all strings have been created is 
difficult, if not impossible, to accomplish in Verilog. In order to combat this, a ROM was 
created that picked which elements would crossover. This was achieved by creating a 
C++ program that generated 32-bit words. These words consist of 8 nibbles, with each 
nibble representing which one of the 16 elements in the population is selected for 
crossover. The selection probability can be changed to whatever is desired for 


experimentation. 


The roulette wheel is based on the desire that each element of the 
population has a chance at being selected for crossover. The sum of all of the fitness 
values is first determined. Next a random number is generated between the 0 and this 


sum. A running total is next initialized to zero. The fitness value of each of the elements 
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is added to the running total until the running total meets or exceeds the random number 
generated. When this happens, the last element added to the running total is selected for 


crossover. 


The criterion for the formula that was implemented was to provide a rough 
approximation of the roulette wheel method, albeit not at the same proportions as 
described by the roulette wheel method. At the output of the sorting circuit, the elements 
are sorted from most fit to least fit. They are then assigned a name, with 0 indicating the 
best fitness value, and 15 the worst. A list was created from which each element is 
selected. The composition of the list is initially 16 0’s, 15 1’s, ..., and 1 15. This 
distribution allows all elements to be selected for crossover, while favoring those 
elements with the best fitness values. Initially, the probability that the best fitness value 
is selected for crossover is 11.7%. Likewise the second best and worst fitness values 
have a probability of 11.0% and 0.7%, respectively. The elements of the list are then 
shuffled and the first element is selected as the first nibble in the word that was being 
stored in the ROM. The list is then parsed removing all copies of the element that was 
just selected. Thus, if the fittest element is selected on the first choice, the second fittest 
element now has a probability of 12.5% of being selected, while the worst fit element has 
a probability of 0.8%. Generally, GA implementations allow an element to be selected 
for crossover multiple times. However, due to the small population size, this 
implementation removes elements selected for crossover to help force genetic diversity. 
This process continues until eight elements are selected. The remainder of the C++ 
program creates the structure that wraps the ROM values with the necessary Verilog code 
to facilitate its instantiation. An example of one word that would be stored in this ROM 


is 32’b0000_0001_0011_0010_1000_0110_0100_1110. When this ROM is read, 





elements 0, 1, 3, 2, 8, 6, 4 and 14 would be selected for crossover. 


All crossover operations are contained within the crossover module. It 
consists of a crossover ROM and 4 crossover units. The ROM provides a control signal 
to the two multiplexers in the crossover units directing which elements will be selected 
for crossover. The crossover units process the selected elements to create the children. A 
crossover module is shown in Figure 21. 
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Figure 21 Crossover module 


Since crossover and mutation destroy the fitness value of the 
chromosomes, the bits representing the fitness value are discarded. Each crossover unit 
consists of 2 16-to-1 multiplexers and a bit swapper module. Each of the multiplexers is 
controlled by one of the nibbles produced by the previously mentioned ROM. The output 


of these multiplexers is then applied to the bit swapping module as shown in Figure 22. 
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Figure 22 Crossover unit 


The bit swapping module consists of fourteen 2 input multiplexers. Each 
multiplexer receives the same control signal, the crossover code; however, the inputs are 
reversed in one of multiplexers as shown in Figure 23. This creates a 14 bit swapping 
circuit where each bit is swapped based on its associated value in the crossover code (the 
control signal for the multiplexers that perform crossover). This is the same basic idea 


that is used in the swapper elements for the sorting circuit. 
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Figure 23 1 bit crossover circuit 


As previously mentioned, care should be taken when choosing the bit 
position where crossover will occur. Since the fitness value is composed of two 
subvalues, the decision was made to perform crossover at the boundary between the 
objects with a weight of 6 pounds and those of 3 pounds. Consider two elements with 
relatively high fitness values. One could meet the criteria of having four 6 pound objects 
selected, and the other chromosome meets the criteria of having a combination of objects 
selected that total 4 pounds. Then, these two elements crossover at the aforementioned 
boundary, one of their children would have the optimal solution. In order to provide 
greater flexibility, and to demonstrate the effects of choosing a crossover point, the 


crossover code is capable of being specified by the user at run time. 


38 


c. Mutation 


Although it was previously shown that mutation is essential to GAs, it is 
nonetheless a rare occurrence. Since mutation is essentially the inversion of a bit, this is 
most easily accomplished through an exclusive OR gate. In this implementation of the 
GA, the frequency of mutation is composed of two factors, how often a mutation might 
take place, and how likely it is that each individual bit will mutate. The probability of 
mutation is usually on the order of 1 bit out of a 1,000; however, like all aspects of the 
GA, each of the parameters is very dependent on the problem [4]. Since this research 
does not expect mutation to play a large role due to the design of the chromosome and 
fitness function, mutation is implemented to demonstrate proof of concept. For example, 
suppose that there is a 1% chance that a mutation will take place during a generation. For 
each time that a mutation takes place, each bit has a certain probability, for example 10%, 
that it will mutate. This means that a mutation might be “scheduled” to take place in 


which no mutation actually occurs. 


A mutation string could be shown to be the 14-bit string that represents 
which bits will be inverted. Because of the difficulties that are involved in making these 
determinations in Verilog, these mutation strings are most easily stored in a ROM. For 
each generation in which mutation does not take place, the 14-bit string is 0. This implies 


that one could have 14x100=1,400 consecutive zeroes before the occurrence of a one. 


This means that the mutation ROM is a good candidate for compression. 


The compression for mutation was decided to be as simple as possible to 
provide at least the proof of concept. It was then decided to create the mutation string as 
follows: 7-bits of zero run length, and 14-bits of mutation code and 4-bits of element 
selection. The seven bit length is based on a given probability that there is a 1% chance 
that mutation will occur during a generation. This means that, over 128 generations, it is 
likely that a mutation will occur. However, since this is not guaranteed, the exception to 


this case will be discussed later. 


Again, the ROM is generated as Verilog code using a C++ program. 


Random numbers are generated with the specification that 1% of the strings will mutate. 
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For each bit, there is a 10% chance that it will mutate, which is represented by encoding a 
one. In the cases in which 128 generations pass without a mutation, the mutation code is 
automatically created as a 14-bit 0 vector. Finally, the 4 bits to select which element of 


the population is to be mutated are randomly created in the C++ program. 


Figure 24 shows the operation of the mutation circuit. It consists of four 
main parts. Starting from the right, a multiplexer that is controlled by how many 
generations have passed since the last mutation. If the GA is ready to mutate, then its 
output is the mutation code and element selection from the ROM; otherwise it is zero. 
Next is the mutation ROM. The second element is an incrementer that controls the 
address lines for the ROM. The first element is another incrementer that counts the 
number of generations since the last mutation. It compares that number with the zero run 
length value stored in the ROM, and generates the control signal for the multiplexer 
controlling the mutation code. The remaining circuitry is provided to control 


initialization and operation of the circuit. 
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Figure 24 Mutation ROM 


C. ADVANCED IMPLEMENTATION ISSUES 


1. Circuit Reutilization 


One of the key elements to the GA is the fitness function. The current 


implementation of the GA includes 16 elements. Each of these elements requires two 
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instances of the fitness function. The first performs the calculation of the fitness value 
from the previous generation, and the fitness for the value from the LFSR. This is made 
possible due to the small size of the fitness function. Unfortunately, if the fitness 
function was chosen as the nonlinearity calculation, thus allowing searching for bent 
functions over all of the search space of n=6, the available resources on the FPGA 
would be quickly exhausted. Table 9 shows the utilizations for two instances and eight 
instances of the n =6 nonlinearity calculation. Of further clarification, no other circuitry 


is involved with these utilizations. 














2 calculations | 8 calculations 
Number of Slice Flip Flops | 12% 33% 
Number of 4 input LUTs 19% 70% 
Number of occupied Slice 28% 94% 
Freq 100.1 MHz 100.0 MHz 

















Table 9. FPGA utilization 


Because of the excessive utilization of the FPGA resources, it is necessary to 
efficiently use the available resources. There are two methods for this. The first is to 
spread the circuitry over several FPGAs, or through multiple executions, reprogramming 
the FPGA during execution. For example, moving 128 bits between the macro and 
subr.mc requires using two 64-bit variables. Unfortunately, the current version of the 
FPGA C compiler does not provide an efficient mechanism to move large amounts of 
data between the Verilog module and the C code on the FPGA. This is problematic, 
since for each 64 bits, or fraction of, one variable must be used to pass the data between 
the module and the C code, and this must be done for all of the 16 elements. Thus, in 
order to only determine the nonlinearity of one function on n=10, 16 variables are 


required to be passed to the macro. 


This implies that it might be better to reuse the circuitry that is already laid down 
on the FPGA. What this means, in the case of the fitness function, is to provide a state 


machine that will load a specified register on each clock cycle. Figure 25 demonstrates 
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how this can be accomplished. In this case, the fitness function is substituted by a shift 
register to show how the different registers can be loaded with an expected value on each 


clock cycle. 
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Figure 25 Circuit reutilization 


This circuit operates by three main processes. The moore34 module is a state 
machine. Its output controls the 4 to 16 decoder with enable circuit. The decoder circuit 
operates as a one hot decoder, if its enable bit is set, otherwise, all of its outputs are 0. Its 
outputs are used to enable one of 16 registers that are loaded by a common function, in 
this case a shift register. The shift register is initialized on the CLR signal from a stateful 
macro. A discussion of stateful macros is included in Appendix A. The state machine 


that was previously mentioned is described in Figure 26. 
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Figure 26 Reutilization state machine 





The basic premise of the state machine can be determined by examining the bit 
code for each state as shown by the number inside each state. The MSB is inverted and 
applied to the enable line on the decoder. The remaining four bits are then directly 
applied to the decoder to determine which register needs to be loaded. When the state 
machine receives its clear signal, it aligns itself such that the next transition will cause 


register 0000, to be loaded. Each successive register is then loaded until all have been 


loaded. The machine then loops until the next CLEAR signal is received causing the 
machine to reset. When this machine is implemented with an actual fitness function care 
must be taken regarding the latency of the fitness function. Its latency must match how 
long the machine cycles in an initial delaying state until a fitness value is propagated 


through its pipeline and is ready to be loaded onto an element register. 
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2: Random Access to ROMs 


In order to provide greater variation in program execution, the aforementioned 
ROMs should not be accessed sequentially, but instead as randomly as possible. The 
method that was tested to implement this is through a 32-bit cyclic-redundancy check 


(CRC). 


To provide another element of randomness, the value that was calculated via its 
CRC is calculated by a random number from main.c and added to a timer value from 
subr.mc. Because accessing the timer prevents pipelining within a loop, the 
aforementioned sum is incremented during each loop iteration. This provides a 


randomization of accessing the ROM. 


Because of the fixed width nature of the data whose CRC is being calculated, a 
table lookup CRC was implemented. This allows calculation of the CRC one byte at a 
time, thus reducing by a factor of four the computation time when compared to a CRC 
calculation done on a bit-by-bit basis [11]. The CRC lookup table was generated by C 
code using the method described by [20]. CRC calculation was performed by unrolling a 
CRC calculation loop of 32-bits and translating the resultant C code into Verilog. This 


circuit is realized in Figure 27. 


See ee go a 


Figure 27. CRC circuit 

















D. SUMMARY 


This chapter describes GAs, with a focus on how they work. The implementation 
of a GA on the SRC-6 is also given. The weight problem discussed in the chapter is 
fundamental to the GA that is implemented in this thesis, and will be referred to in 


subsequent chapters. The next chapter discusses the discovery of bent functions. 
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IV. BENT FUNCTION DISCOVERY 


A. OBSERVATIONS 


1. Co-function Repetition 


An analysis of the bent functions on n=4 suggests that co-functions might be a 
means to construct some bent functions. When the 896 bent function on n=4 are 
analyzed by co-function, it is revealed that there are 112 unique co-functions. By 
definition, there are two positions where a co-function can exist, the high and the low 
position. Each of the co-functions occurs 8 times in the high and 8 times in the low 
positions. This same analysis was conducted on n=6 with similar results. In this case, 
among the 5,425,430,528 bent functions on 6 variables there are 14,054,656 unique co- 
functions. Each of the co-functions occur the same number of times in the high and low 


position. We propose the following: 


Conjecture: All co-functions of n-variable bent functions occur the same 


number of times in the high and low position among n -variable bent functions. 
2. Index Runs 


It has been shown that there are 42,386,176 bent function A-classes on n=6 
[12]. We have verified this on the SRC-6. All of these bent functions were enumerated 
using the degree method described in section II.C.4. Since there are 2°' bent functions 
in each A-class for n=6, there are a total of 5,425,430,528 6-variable bent functions. 
During this, an interesting series of observations were made. First, it is important to 


understand how the mapping is accomplished. The initial mapping formula is derived 


from the highest degree of a bent function is 5 and is shown in Tables 10 and 11 [13]. 
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Table 10. Degree mapping, n =6, Map A, part | 


| Degree | s{4[4[ 3 4]3}3]2[4[3]3]2}3}2}2{ifa{s{ si 2}3}2}2}ifatetetilatitiso| 
[ ANF bit | 31 | 30] 29 | 28] 27 | 26] 25] 24] 23] 22 21 | 20] 19] 18] 17] 16 15] 14] 13] 12 ary 10] 9 | 8 V7N 6 Vs¥ 4 (3 2] 1 J 0 | 





[index | x] x] x] 25} x | 26] 27] s | x [28] 29] 6 [3of 7] 8] x] x] 31} 32] 9 Raz, sof a] x Aaa 23h x Ng x] x] x] 


Table 11. Degree mapping, n=6, Map A, part2 


The first row in the tables corresponds to the degree of the function based on a bit 
being present in that position. The second row indicates which bit is being referred to. 
The bottom row is an index to distinguish between second and third degree functions. 
For ease of reading, the second degree bits are in red, and the third degree bits are in blue. 


Those bits corresponding to degrees other than 2 and 3 are indicated by a black ‘x’. 


Looking solely at the bits corresponding for the second degree functions, we 
notice the first bit is at location 48 and the second at 40. Their corresponding indices are 
Q and 1. This pattern continues until the last a degree function, located at bit 3, is 
assigned the index of 14. Similarly, the an degree functions begin by again starting from 


the MSB and working right. Thus, bit 56 has index 15, and 52 has index 16. 


The following short hand is introduced to describe the mapping. For example, in 
Tables 10 and 11, the 2"! degree functions are mapped as 314, 513, as shown by red 
circles. Likewise, the 3“ degree functions are mapped as 7334, 11333, as shown by 
blue circles. The map shown above shall be referred to as Map A. Different patterns 
have been noticed by changing the mapping order. For example, using the above short 
hand, a new map can be defined as 2": 334, 5433 and 3": 7319, 11918. This 
mapping shall be known as Map B. Map C is described as 2": 30, 531 and 3™: 
7315, 11416. Map D is described, as 2™. 3520, 5>21 and 3: 730, 1191. Finally 


a random map was used. These mappings are shown in Tables 12 through 19. 


| ANF bit | 63 | 62| 61 | 60] 59] ss] 57] s6| ss] 54] 53] 52] 51] so] 49 as 47| 46] 45] 44] 43 | 42 41 | 40] 39] 38] 37] 36] 35] 34] 33 | 32 
| index | x] x]x]xfx}x]xfotx[ x] x]r] xf 2] 3]rof x] x] x] 4] x} s] ofa «[ 7] 8] 22] 9] 23] 24] « | 





Table 12. Degree mapping, n =6, Map B, part 1 
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Table 13. Degree mapping, n =6, Map B, part 2 


| ANF bit | 63 | 62] 61 | 60] 59 | 58 | 57] 56] 55] 54] 53] 52 51] so] 49] 48 | 47| 46] 45] 44 43| 
[| index | x] x[x]x}x}x[ x [34] x] x] x] 33] x [32] sr] ia] x] x] x [30] x | 29] 28] 13] x J 27] 26] 12] 25] 1] 10] x J 





Table 14. Degree mapping, n =6, Map C, part 1 


| Degree | 5] 4] 4] 3} 4/3] 3/2] 4/3] 3] 2]3/2/2]if4]3}3{2}3]2}2tii3at2tatil2}itilo| 
| ANF bit | 31 | 30| 29] 28] 27 | 26| 25 | 24] 23| 22] 21 | 20] 19] 18] 17] 16] 15] 14] 13] 12Vis\ 10] 9 | 8 V7\ 6 Vs\ 4 73\ 21 1 | 0] 
[| index | x] x] x] 24] x] 23] 22] 9] x [21] 20] 8 Jif 7] 6] x] x ]as}i7] s Nf 4 | 3 | x Usd 2 Kd x Kod x | x] x] 





Table 15. Degree mapping, n =6, Map C, part 2 


| ANF bit | 63 | 62 61 | 60] 59 | ss 57] 56] 55] 54] 53] s2| 51 so] 49 | 48 | 47] 46] 45] 44] 43 | 42 | 41 | 40] 39] 38 37] 36] 35] 34] 33 | 321 
| index | x] x[x]x}xfx[ x fro] x [x] x fast x [7] so] say x] x] x fas} x | 14] 13] 33] x J 12] af 32] 10] 31] 30] x J 





Table 16. Degree mapping, n =6, Map D, part 1 


| Degree | 5] 4] 4] 34/3] 3/2} 4/3] 3}2}3{2/2}ifai3}3}2}3]2}2ti[3t2tatil2titi lol 
| ANF bit | 31 | 30| 29] 28] 27| 26| 25 | 24] 23| 22] 21 | 20] 19 18] 17] 16] 15] 14] 13] 12Vis\ 10] 9 | 8 V7\ 6 Vs\ 4 3) 211 | 0] 
| index | x] x] x] 9] x] 8] 7} 29] x[ 6] 5] 28] 4} 27] 26] x] x] 3] 2 | sry 24] 23] x hod 22a x Raof x] x] « | 


Table 17. | Degree mapping, n = 6, Map D, part 2 





| ANF bit | 63 | 62 61 | 60] 59| ss] 57] 56] 55] 54] 53] 52] 51 50] 49 | 48 | 47] 46] 45] 44] 43 | 42 | 41 | 40] 39] 38| 37] 36] 35] 34] 33 | 321 
| index | x] x] x] x}xfx]x] sx] x] x] so} x} 22] 3 [23] x] x] xfar] x] 8] tf 29] x] 7] as] 2] 6 J 24] 9 | « | 





Table 18. Degree mapping, n = 6, Random Map, part | 





Table 19. Degree mapping, n = 6, Random Map, part 2 


This mapping can be extended to higher values of n by using the following 
technique. To construct the mapping for n=7, the above table is duplicated and 
appended to the LSB of the original table. The degree value for the original table is then 
incremented by one and the ANF bits are renumbered in the same manner as the original 


table, while the duplicated partition is left unchanged. The index of a bit must be 
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renumbered to achieve unique values. Again, since this thesis only focuses on bent 
functions of even n, the case of n=7 is only presented as an intermediate step to 


achieve the mapping for n=8. 


Another method that can be used to construct the table is to recognize that the 
values in the degree row correspond to the number of ones in the binary form of the ANF 


bit. For example, 63,,=111111, has 6 ones in it, and corresponds to a 6" degree 
function. Likewise, 42,,=0101010, has 3 ones in its binary representation 


corresponding to a ae degree function. 


As previously discussed, this method of mapping the functions by degree allows a 
search to be conducted only on those functions that can be bent. However, as shown in 
the above table, this still produces a large search space, 2°. We know that there are 
42,386,176 ROTS bent functions on n=6 [12]. The SRC-6 has 6 On Board Memory 
(OBM) banks located on the MAP, each of which can pass 523,776 64-bit values 
between main.c and subr.mc. There is not enough OBM to allow storing all of the 
indices prior to returning them to the microprocessor. Thus, it will take main.c 14 calls to 
subr.mc in order to retrieve all of the indices representing bent functions. The issue of 
transferring the indices is further complicated when considering the additional code 
required to distribute the indices across the six OBMs. Although striping the data across 
multiple OBMs will make the code more efficient, a less complicated process was chosen 
to process all of the indices. To overcome this limitation, the search space of visas 
broken into 512 equal parts, called partitions. This number of partitions was chosen to 
allow a large amount of the loop to be performed on the FPGA, while still having a 
reasonable assurance the partitions produced a small enough set of bent function indices 
that they would not exceed the capacity of the OBM. This is done by creating a loop in 
main.c which called subr.mc. The loop is over the range of 0 to 511. Subr.mc has its 
own loop that makes 2/2? =2” calls to a macro. This macro acquires 9 bits from 
main.c (passed through subr.mc) and 26 bits from subr.mc to form a 35 bit number that 
represents the ANF of a 3™ or 2™ degree 6-variable function, whose nonlinearity is 


computed in the macro. 
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Upon initial execution of the program, and the subsequent data analysis, it was 
noticed that bent functions commonly occurred as consecutive indices of the 35 bit ANF. 
For example, suppose the index 656 is discovered to be bent. This is to say, that when 
the number 656 is applied to the mapping tables above, the resultant ANF is a bent 
function. The next index, 657, was also discovered to be bent. To completely examine 
this, a program was written to analyze the indices of bent functions for those cases where 
consecutive indices yield bent functions. These observations introduce two new terms, 
index adjacent (IA) bent functions and an IA group. Two bent functions are IA if their 
index differs by one. A collection of consecutive IA bent functions form an JA group. 
The length of an IA group is the number of IA bent functions in that group. A future GA 
may take advantage of this by searching for bent functions by degrees. In doing so, 
mutation could be implemented by adding one to the index. By doing so, on a bent 


function, it is possible that the resulting function will also be bent. 


Figures 28 and 29 and Table 20 are produced through the analysis of Map A. 
From these figures and table, we can make some interesting observations. First, note that 
there are no JA bent functions in partitions above 31. Figure 29 shows how the IA groups 
are distributed among the 512 partitions. For example, Figure 28 shows that 123,045 
bent functions occur in Partition #0, the first partition. Finally, Table 20 shows a table of 
the length of the IA groups. Second, which only holds true for this experiment, is that all 
runs exist in the pattern of 2”—1,1<n<4. Figures 30 through 35 and Tables 21 through 


23 show the corresponding representations for Maps B, C and D. 
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Figure 28 JA bent functions, Map A 
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Figure 29 JA groups, Map A 
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Figure 30 JA bent functions, Map B 
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Figure 31 IA groups, Map B 
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Table 21. | Frequency of group length in all partitions, Map B, n =6 
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Figure 32 JA bent functions, Map C 
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Figure 33 JA groups, Map C 





Table 22. Frequency of group length in all partitions, Map C, n= 6 
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Table 23. Frequency of group length in all partitions, Map D, n=6 


More interesting results are obtained through a random process. A C++ program 
was written that takes the bits that are mapped to an index and puts them into a data 
structure. The ordering of the elements in the data structure is then shuffled. The first 
element is assigned the next sequential index, and then removed from the data structure. 
This process continues until the data structure is empty. The Figures 36 and 37 and Table 


24 show the distribution of the IA bent functions groups and their corresponding lengths. 


These results show several things of interest. First, is that each partition run has 
at least one IA bent function group. Closer analysis shows that the minimum number of 
IA groups in a partition run is 8. In all of the other maps, at least one partition has no IA 


groups. The other item of interest is that all IA groups have a length of either 2 or 3. 





IA Bent Functions, Random Map 

















Number 













| 





TA 


Da dd be 





| 
| 
: 





100 i , an 
A TT | 
0 
[SoM oO MON MN Ore FR OO MOAN HON Dr tTre OY CO 
ue ee See te es ee ee eee & 
aoe. 


Partition Number 





a Number of lumped functions 




















Figure 36 IA bent functions, Random Map 
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Figure 37 IA groups, Random Map 





Table 24. Frequency of group length in all partitions, Random Map, n =6 


An analysis of this data yields two conclusions. The first is that a future GA may 
be able to incorporate this process in finding bent functions. Suppose the chromosome is 
the random index mapping to the ANF representation of a function. If mutation is 
defined as the addition of a bit to this index, it is possible that one bent function could 
mutate into another bent function. Additionally, it may be practical to use the random 


index map on the index mapper in order to find a bent function on n=10. 
Based on the figures and tables above, the following term is introduced: 


# partitions with IA bent functions 





TA concentration = # of IA groups x — 
#partitions 
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Table 25 shows that the IA concentration is highest for the random map. This is of 
interest on how to choose the mapping for a GA that implements a function’s ANF as a 
chromosome. If mutation is implemented as adding 1 to the index, the mutated 


chromosome of a bent function may also be bent. 


| __ HlA concentration 
2,004.1 
9,070.3 


2,067.0 
15,701.8 
Random 27,048.00 





Table 25. JA concentration on n=6 


Figures 38 to 42 show the bent function distribution per partition for the five 
different maps use. The first item of note is in Figures 38 and 40. In each of those tests, 
the first partition contains considerably more bent functions than the others. The second 
item is visible in the figures, and verified in Table 26. Table 26 shows the standard 
deviation of the number of bent functions per partition. The random distribution shows 
that it may be more practical to find bent functions using the random map when 
examining problems with more variables. This is based on the observation that as the 
number of variables goes up; the scarcity of bent functions goes up even more. It is 
proposed that since the random map has the lowest standard deviation, the existence of a 


bent function in a partition for n=8 or n=10 is more probable. 


57 








Number of Bent Functions 


500000 
450000 


400000 + 


350000 
300000 
250000 
200000 
150000 
100000 
50000 
0 


Bent Function Distribution, Map A 


110 
132 
154 
176 
198 
2 
4 
6 
8 
0 
330 
352 
374 
396 


Partition Number 


484 








Figure 38 Bent function distribution, Map A 
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Figure 40 Bent function distribution, Map C 
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Figure 41 Bent function distribution, Map D 
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Figure 42 Bent function distribution, Random Map 


Table 26. 


B. 


BITSTUFFING 


Standard 
deviation 


22,204.96 





Bent function standard deviation per partition 


From the weight property of bent functions, we know that a bent function has one 


of two predetermined number of ones in it, for a given number of variables. Because of 


this, we can construct a function with the appropriate number of ones in it, with the 


miniscule hope that it is bent. This process shall be known as bitstuffing. Although there 


are several properties of bent functions, that have not been discussed, that can improve 


the probability construction of a bent function, this discussion will focus on the ROTS 


functions. 
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1. The Ones Hypothesis 


Recall that there are eight 4-variable ROTS bent functions. Each of the eight sub- 


tables of Table 27 represent all of the 4-variable bent functions that can be created from 


the 4-variable ROTS bent functions by exclusive ORing a ROTS bent function with an 


affine function. Note that the resulting function is bent, but not necessarily ROTS. The 


first sub column represents a function that has been determined to be bent and ROTS 


through a brute force enumeration of the search space. The second sub column is an 


affine function. The final sub column is the result of the exclusive ORing of the 


candidate ROTS bent function and that particular affine function. 


Fn 

0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 
0536 


177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 
177E 


6CA0 
6CA0 
6CA0 
6CA0 
6CA0 
6CA0 
6CAO 
6CA0 
6CA0 
6CA0 
6CA0 
6CA0 
6CA0 
6CA0 
6CA0 
6CA0 


Afn Result 


Table 27. 


Fn 

TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
TEES 
7EE8 
TEES 


8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 
8117 


935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 
935F 


ROTS bent functions on n = 4 


Afn 

FEFF 
AAAA 
cccc 
6666 
FOFO 
5A5A 
3c3c 
6969 
FFOO 
55AA 
33cC 
6699 
OFFO 
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Result 
8117 
D442 
B224 
188E 
8E18 
24B2 
42D4 
1781 
81E8 
2B42 
4D24 
1871 
7118 
244D 
422B 
E881 


8117 
D442 
B224 
188E 
8E18 
24B2 
42D4 
1781 
81E8 
2B42 
4D24 
1871 
7118 
244D 
422B 
E881 


6CAO0 
C60A 
AO6C 
OAC6 
9C50 
c3905 
509C 
o5c9 
93A0 
390A 
A093 
0A39 
6350 
3605 
5063 
0536 


Fn 
E881 


Result 
E881 
422B 
244D 
7118 
1871 
4p24 
2B42 
81E8 
1781 
42D4 
24B2 
8E18 
188E 
B224 
D442 
8117 


0536 
5063 
3605 
6350 
0A39 
A093 
390A 
93A0 
05c9 
509C 
c905 
9c50 
OAC6 
AQ6C 
C60A 
6CAO 


This implies that a sieve can be easily constructed to help search for the bent 
functions. Furthermore, this sieve can be constructed to target ROTS bent functions. 
Suppose you have a function you would like to determine is bent. An initial step could 
be to count the number of ones in it, and for the case of n=4, see if the answer is either 6 
or 10. If this is not the case, then the function is not bent. If this is the case, then it may 
be bent, but there is no guarantee that it is bent. The remainder of this discussion will 


focus on the case of a bent function containing 6 ones. 


The goal is to determine if a ROTS function has 6 ones. To start, the function that 
maps the rotationally symmetric index (RSJ) to the truth table must be examined. One 
lab during EC4820 involved the creation of a C program that determined the rotationally 
symmetric index function. This paper will only focus on those results. The below table 


demonstrates how the 6-bit RSI is mapped into a 16-bit truth table. 


To create the 16-bit number from the RSI, each of the 6-bits of the RSI is 
examined one at a time. The RSI and the 16-bit truth table are formatted with the bits 
laid down with the 0 bit being on the right. The value of the i" bit of the RSI is then 
mapped into the 16-bit number according to the below table. This means that the value 
of bits 8, 4, 2 and | in the expanded number are the same and equal to the value of bit 1 


in the RSI. 


Next, the number of ways to create a 16-bit number from the RSI was examined. 
This was done via a histogram on the bin frequency of the RSI in an Excel spreadsheet. 


Table 28 shows these results: 
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Source bit Frequency 
0 [ 
1 4 
2 4 
3 2 
4 4 
5 1 














Table 28. RSI histogram, n=4 


In this table, a value from each of the 6-bits in the RSI is used the number of 
times listed in the frequency column. From this, it can be shown that there are six ways 
to get a number with 6 ones. To get a number that has 6 ones in it, each source bit must 
be set high until the frequency total is 6. Thus, the source bit combinations that are 
required to get 6 ones is 015, 025, 045, 13, 23 and 34. However, since there are only 4 
ROTS bent functions on n=4, we know that 2 of them must be eliminated. Table 29 
shows the 6 ways to get 6 ones in the 16-bit format. The two functions that are not bent 
are highlighted in red. Because not all functions generated through this process are 


ROTS bent function, it is only a sieve to construct ROTS bent functions. 


In Table 29, the ‘“‘selected source bit’ column shows which bits in the RSI must be 
high to get 6 ones in the 16-bit format. The next 4 columns labeled 15, 11, 7 and 3 
represent each of the 4 nibbles in the 16-bit number. Each column is labeled according to 
the MSB in each nibble. The final column is the 16-bit hexadecimal representation of the 
number being constructed. The bolded red numbers indicate RS functions that have 6 
ones but are not bent. This can easily be proved by finding an affine value that, when 
exclusive ORed with the function, produces a result with less than 6 ones. For0x9249 , 
the affine value is 0x9669 (0x9249 © 0x9669 = 0x0420 = 0000010000100000, ) and is 


determined through a brute force enumeration of the affine functions. For0x1668 , the 
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affine value is also 0x9669 (0x1668 © 0x9669 = 0x8001 = 1000000000000001,). The 
bent functions are now known to be 0x536,0x6CA0,0x8117 and OxE881 through 


























enumeration. 
Binary representation 
Selected source aaa 
bit 15 11 7 3 

015 1000 0001 0001 O111 8117 
025 1001 0010 0100 1001 9249 
045 1110 1000 1000 0001 E881 
13 0000 0101 0011 0110 0536 
23 0001 0110 0110 1000 1668 
34 0110 1100 1010 0000 6CA0 

















Table 29. RSI functions with 6 ones 


This same concept can be used to produce a sieve for larger cases of n. The case 
implemented for this thesis was for n=6. This results in a 14-bit RSI being mapped into a 
64-bit number. The RSI mapping function has been previously discussed under the guise 


of the 28 pound packing problem discussed in III.B. 
Ze Execution of a GA on n =6 


The GA has been implemented on the SRC-6, as previously described in this 
thesis. The results of the data trends from the SRC-6 are discussed below. As will be 
described, different parameters of the GA were altered in order to see how the results 
varied. However, for the sake of consistency throughout all of the experiments discussed, 
the same set of seeds is used for the LFSRs. Additionally, all of the ROMs were 
addressed in a sequential manner, versus the CRC addressing method previously 


described. 
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The following terms will be used throughout the discussion of the GA results. 
Since the GA implements the ones’ count hypothesis, it is a sieve for bent functions. 
What that means is that a “fit” chromosome from the GA, meaning it has the maximum 
fitness value of 240, may be or may not be bent. The term “percent fit” describes how 
many of the chromosomes out of all of the generations have a fitness of 240. If a 
chromosome has a fitness value of 240, and its corresponding truth table is a bent 
function, then it is referred to as a “chromosome yielding a bent function”. The “yield” is 
the percentage of all chromosomes with a fitness of 240 whose TT yields a bent function. 
Bent functions are only counted once regardless of how many times they generated 
through the chromosomes, and are thus referred to as “unique”. Unless specified 


otherwise, the minimum fitness value to prevent chromosome replacement is 150. 


Several methods were used to implement the crosscode. Unless stated otherwise, 
the crosscode of OxlF is used. This specifies the boundary between the bins 
corresponding to a weight of 6, and all other bins. This is a single point crossover. Later, 
two types of random crosscodes are tested. The first type is one that has the crossover 
occur at one place within the chromosome. The other type allows for multiple crossover 
points. In all cases, the same crosscode is used for all elements in a particular generation. 


In some cases, described in detail later, each generation has its own crosscode. 


The first set of tests shows effects of changing the minimum fitness value. There 
are essentially three interesting things to look at in the results of when the minimum 
fitness value is changed. The first is when the minimum fitness value is 0, the GA 
behaves more like a GA in which new chromosomes are only introduced through 
crossover. Conversely, when the minimum fitness value is 240 the GA behaves like a 
brute force search, albeit with some “genetic” aspects to the brute force. Finally, the 
remaining interesting point is where the GA seems to change its success rate. This first 
set of tests was run under two generation lengths, 512 and 2,000. The former choice is 
based on 16 chromosomes per generation for 512 generations yields 8,192 chromosomes 
examined. This is half of the total chromosomes in the entire search space. The later 
choice is an arbitrary larger value. Figure 43 shows the number of unique bent functions 
found, while Figure 44 shows the number of chromosomes yielding a bent function. 
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Figure 44 Chromosomes yielding bent functions 
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Of greatest interest here is the distinct jump in the number of chromosomes 
yielding a bent function between the minimum fitness of 120 and 150 as seen on the 512 
generation test. This result is also shown in Figures 45 and 46, which show the percent 
fit and the yield for these tests. Because of these results, the minimum fitness of 150 was 


chosen as the minimum fitness of subsequent testing. 





Effects of changing minimum fitness 
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Figure 45 Percent fit versus minimum fitness 
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Effects of changing minimum fitness 
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Figure 46 Yield versus minimum fitness 


Figures 47 and 48 show the effect of changing the crossover point on the number 
of chromosomes yielding bent functions, and also the number of unique bent functions 
located. There are five sets of tests in this experiment, with each test running for 512 
generations. The first test is the control test, with a crossover point of OxlF as described 
with Figures 43-45. The second test uses a multi-point crossover with code OxEDO. 
Multi-point crossover means that instead of one point determining where the 
chromosomes are split and recombined, there are several points. For example, the multi- 


point crosscode 0xEDO=1110_1101_0000, means that the bits 4, 6, 7, 9, 10 and 11 


will crossover, where bit 0 is the LSB. The third test is a single point crossover with 
crosscode 0x7. These two crosscodes were used in all generations. The final two test 
sets each have a unique crosscode for each generation. In the first case, the crosscode is a 
multi-point, and in the last case, it is a single point crossover. The most obvious piece of 
information gained from this set of experiments is that, with the current design of the 


chromosome, and fitness function, the success of the GA is definitely dependant on the 
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crossover code. This can be seen in two places. The first is on the case where the 
minimum fitness value is 120. By the design of the fitness function, 120 is a key point 
since it represents that 1 of the 2 subproblems has been “solved”. The other issue is the 
randomly generated multi-point crosscode yields some interesting results when the 
minimum fitness value is 150. The correlation between the crossover code and the 
minimum fitness value appears to be dependant on the minimum fitness value. As it gets 
higher, different crosscodes yield better results. This may be due to the GA starting to 


resemble a brute force attack as the minimum fitness value approaches 240. 
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Figure 47 Effect on number of unique functions due to changing the crossover point 
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Figure 48 Effect on number of chromosomes yielding bent functions due to changing 


Figures 49 and 50 show the effect of changing the crossover point has on percent 
fitness, and the percent yield. Overall, each of the crossover methods has a tendency to 
yield more fit chromosomes as the minimum fitness value rises. The yield is somewhat 
affected, although the values are too small to draw any definitive conclusions, with the 
exception of a few data points that show a marked change in the performance of the 
crossover point with respect to the minimum fitness value. The most notable points are 
shown on Figure 50. The test single point change has its percent yield drop go from 
4.77% to 0.73% as the minimum fitness changes from 90 to 120. On the multipoint test, 


the yield goes from 0.62% to 8.87% to 0.53% as the minimum fitness value changes from 


120 to 150 to 180. 
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Figure 49 


Percent versus minimum fitness for a changing crosscode 
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Effects of changing crossover point 
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Figure 50 Yield versus minimum fitness for a changing crosscode 


The final set of tests involves the effects of changing the number of generations. 
Again, the original crosscode of OxlF is used. Three cases are examined for the 
minimum fitness value. Again, the minimum fitness is varied in increments of 30 from 0 
to 240. Figures 51 and 52 show some interesting results. The first is that the number of 
unique bent functions becomes saturated very quickly. In essence, this “saturation” is 
actually a limit indicating that all of the ROTS bent functions on n=6 that could be 


found were found. That statement has its basis in the fact that this GA is only searching 
ye: 


for ROTS functions that have 28 ones in it. This process will only locate half of the 
ROTS bent functions, since only half of them have 28 ones in them. Their complements, 
however, contain the other half of the ROTS bent functions on n=6. This means that 
for all cases, running the GA for more than 6,000 generations does not yield any 
additional bent functions. Prior to that threshold, there is a linear relationship between 
the number of unique bent functions and the number of generations. The number of 
chromosome yielding bent functions behaves drastically differently, though. As the 
number of generations increase, there is nearly a logarithmic rise in the number of 


chromosomes yielding bent functions. 
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Figure 51 Unique bent functions versus changing generations 
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Figure 52 Chromosomes yielding bent functions versus changing generations 


Figures 53 and 54 show the relationship between the percent fit and the yield. In 
all cases for the percent fit, there is little change once 2,000 generations have elapsed. 
There is an inverse relationship between the yield of the GA and the minimum fitness 
value versus the number of generations. For example, when the minimum fitness value is 
0, the yield grows as the number of generations grows. The converse is true for a 
minimum fitness value of 240, as the number of generations grows, the yield diminishes. 
This is due to the fact that with a minimum fitness value of 240 the GA starts to emulate 
a brute force attack. However, since chromosomes are eliminated through selection, and 
chromosomes that do not have a perfect fitness are also eliminated, this case actually 
behaves worse than brute force. The yield for the minimum fitness value of 150 is 


relatively unchanged. 
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Figure 54 Yield versus number of generations 


Finally, a test was conducted to determine the effectiveness of crossover on 
producing bent functions. This analysis is limited to the tests involving 512 generations. 
As more generations elapse, the LFSRs produce the set of pseudorandom numbers that 
rapidly covers the entire search space. This can be accomplished in as little as 1,024 
generations if the LFSR seeds are strategically space, which they are not. Regardless, it 
becomes extremely difficult to determine if a chromosome is the resultant of it being 


generated by the LFSR, or by crossover. 
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This process starts by storing all of the values that the LFSR will generate over 
512 generations into a text file. This text file is then read, and its values are stored into a 
C++ set data structure. This type of data structure can be viewed as a specialized vector 
in which no two elements have the same value. It should be noted that the underlying 
data structure in a set is probably not a vector; a vector is merely mentioned due to its 
familiar nature. Next, the chromosomes with a fitness value of 240 are loaded into a 
different set. Each element of the first set is searched for in the second set, and removed 
if found. The elements remaining in the second set after this process are those that must 
have been generated through crossover. It is possible that some fit chromosomes were 
generated through crossover that were also generated by the LFSRs, and are thus being 
masked. However, given the results of Figure 55, a sufficient number of fit 
chromosomes are being generated through crossover that further investigation on the 


success of crossover is not warranted. 


Figure 55 shows the percentage of all fit chromosomes that could not have been 
produced by the LFSRs during the run of 512 generation. This figure shows that 


crossover is indeed being effectively used to generate fit chromosomes. To be specific, 


fully vA to Y, of the fit chromosomes came from crossover. Actually, it cannot be 


determined if they came from crossover or mutation. However, by design, mutation is 


already sufficiently rare and ineffective that its contribution can be neglected. 
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Figure 55 Fit chromosomes due to crossover 
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C. ROTS SEED ON n=4 


To assist in reviewing the bent functions on n=4, a computer program was 
written that serves as a filter for the bent functions. The user is able to specify which of 
the nibbles he wishes to examine, and those bent functions that meet that requirement are 
shown. From this, it was observed that a given co-function exits numerous times across 
all of the bent functions. To further process this information, another program was 
written that specifically tallied all of the co-functions, and their respective frequency in 
both the high and the low position. These results yielded an interesting observation. 
There are 112 unique co-functions on n=4. Each of those co-functions appears 16 
times. From this it is hypothesized that bent functions of a certain number of variables 


can be constructed using co-functions of the same number of variables. 


After an extensive comparison of the co-functions with the affine functions, the 
following algorithm is given to find bent functions on n=4. It is proposed that this 


algorithm be implemented to cover larger n. 


First, all of the affine functions are loaded into a C++ set data structure. Next, all 
of the given bent functions are loaded into a different set. The bent functions that exist 
during the initial execution of this algorithm are known as the bent function seed. Later 


discussion will show how ROTS bent functions are ideally suited for this process. 


The set of bent functions is parsed, with each bent functions being broken into its 
two component co-functions, with the high co-functions being logically shifted to the left 
to align the co-functions. Each of those two co-functions are added to a co-function set. 
Additionally, the bits of each co-function are reversed, and those new values are added to 
the co-function set. For example, suppose the set contains the co-function 


0x74 =01110100,. The number 0x2E = 0010 1110, is then added to the set. 


Second, a nested loop is constructed. Each loop iterated through the entire 
co-function set. While doing so, the current co-function from each loop is exclusive 
ORed, and the resultant value is placed in a temporary co-function set. After the loop 


completes, the temporary co-function set is merged with the original co-function set. 
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Third, a set of proposed bent functions is produced using a manner similar to 
construction of the temporary co-function set. Again, a nested loop is constructed with 
each loop iterating through each member of the set. A proposed bent functions is 
constructed by assigning the co-function in the outer loop as the high co-functions and 


the inner loop as the low co-function. 


Finally, the nonlinearity of the proposed bent functions is calculated, and a new 
set of bent functions is produced. This new set of bent functions provides the input to the 
aforementioned algorithm to again produce a set of functions to be tested for bentness. 


This process was implemented using a Linux shell script. 
Observation: 


Only two functions are required to be in the seed, provided that they are both 


ROTS and of a different A-class, in order to generate all 896 bent functions. 


This can be accomplished in two iterations of the algorithm. Different 
combinations of seeds were constructed utilizing a ROTS bent function and a bent 
function of a different A-class. Although the combinations were not exhaustively 
searched, it is observed that all 896 bent functions can be generated from a given pair of 


seeds. Further iterations of the algorithm fail to produce additional bent functions. 


In a sense, this is actually a genetic algorithm, albeit with the order of genetic 
operations different from those previously discussed in this thesis. Consider each co- 
function from the bent functions as the chromosomes. The operations of combining the 
co-functions with themselves, and the affine functions, could be viewed as a form of 
crossover. Survival of the fittest is mimicked through the nonlinearity calculation. When 
the nonlinearity is determined, those functions that are not bent are removed from 
consideration because they are not “fit”. Thus, in this case, “fitness” is determined by the 
nonlinearity of the function. In this test the mutation operation is not implemented. Each 
iteration of the algorithm is a generation. After a few number of generations, no 


additional information is gained. 


The nonlinearity distributions are shown in Tables 56 and 57 for n=4. This is to 


provide a comparison against the nonlinearity distributions of higher n. 
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Figure 57 ROTS nonlinearity, n=4 
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D. BY DEGREE ON n=6 


As mentioned in IV.A.2, all of the bent functions were enumerated on n=6 from 
the set of functions with degree 3 or 2. Figure 58 shows the graph for the nonlinearity 
distribution observed from these functions. Figure 59 shows the nonlinearity distribution 
for ROTS functions on n=6. Of particular interest is that only the following 
nonlinearities were observed: 0, 8, 12, 14, 16, 18, 20, 22, 24 and 28. The distributions 
for nonlinearities of 0, 8, 12 and 14 are 1, 11,160, 1,749,888 and 22,855,680 respectively. 
The remaining nonlinearity frequencies are so large that the previous nonlinearities are 


not noticeable in the figure. Note that the two graphs share the same basic shape. 





Nonlinearity by Degree, n=6 
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Figure 58 Nonlinearity by degree, n = 6 


81 





ROTS Nonlinearity, n=6 
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Figure 59 ROTS nonlinearity, n = 6 


E. ROTS ON n=8 


The ability to conduct a nonlinearity calculation on n =8 is hampered by the ever 
growing size of the circuitry. In order to combat this, two optimization were made to the 
basic design for the nonlinearity calculation on n=6. The first was implementing the 
complement optimization described in Chapter II. By doing so, this reduces the number 
of nonlinearity values going into the minimization circuit by a factor of 2. This, however, 
does not reduce the complexity of the circuitry to allow a timely compiling. To reduce 
this complexity by another factor of 2, an additional control signal was provided to the 
macro from subr.mc. This control signal controlled a two input multiplexer, whose 
inputs were affine functions. To accommodate for the reduced data throughput in the 


macro, subr.mc is required to make two calls to the macro. 


Consider, for example, the need to count from 0 to 7, in binary. The proper 
pattern would be 000, 001, 010, ... 111. Now suppose you were to count from 0 to 7 


again, however, this time you need to count each number twice. This can easily be 
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accomplished by appending a counting bit to the original three bits. Thus, one is 
essentially counting from 0 to 15, and using the three most significant bits to indicate the 


number that is being counted. 


A loop is normally used to provide the macro with sequential indices, or 
sequential blocks of memory containing the functions to be tested. When the loop is 
providing indices to the macro, it can easily be doubled using the above method. This 
allows a smaller circuit to be placed on the FPGA, albeit at the expense of a slower 
execution time. Care must be taken when doing this on three steps. The first is ensuring 
that the index being sent to the macro in the slowed counter. Using the above example, 
you would need to be sending the macro 000, 001, ... 111 versus 0000, 0001, ... 1111. 
This can be accomplished by conducting a logical right shift by one bit. At the same 
time, the macro also needs to receive the control signal for the multiplexer in the macro. 
This can be accomplished by using a logical AND on a counter, and applying that 
resultant to the macro. Finally, subr.mc needs to compare the nonlinearity value from the 
most recent call to the macro, with the value from the previous call. The lesser of the two 


values is chosen and used as the overall nonlinearity for the function under test. 


As a result of the enumeration of the 2*° ROTS functions on n =8, the following 
observations were made. 15,104 of the functions are bent, and they are equally divided 
amongst 3,776 A-classes. That is, there are four ROTS bent functions in each of the 
3,776 A-classes. The nonlinearity distribution of the bent functions is shown in Figure 
60. Of the total number of ROTS functions, 2.20x10°% are bent. The distribution 
contains all nonlinearities although those below 52 and above 116 are not visible due to 


their small distribution. 
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Figure 60 ROTS nonlinearity distribution, n =8 


F. SUMMARY 


This chapter discussed the various methods used to locate bent functions. It also 
provides the reader with the necessary knowledge to understand the scarcity of bent 


functions, and the need to restrict the search space as much as possible. 
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Vv. SUMMARY 


A. BENT FUNCTIONS 


The method best used to find bent functions is based on the number of variables 
that is being examined. Table 30 shows the scarcity of the bent function. It introduces a 


new term called the concentration factor defined below: 


#bent in search space 





size of search space 
total # bent 


total search space 


cf = 








This value shows how changing the size of the search space, e.g. by restricting it to 
ROTS functions or through searching by degree, the concentration of bent functions 
becomes noticeably higher. This can become a means to determine which method should 


be used to locate bent functions for a higher n. 
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Table 30. Bent function scarcity, After [8] 


B. GENETIC ALGORITMS 


There is considerable potential in using GAs, and genetic based operations, in 
order to find bent functions. The most interesting method used a ROTS seed to find bent 
functions. When using the more traditional approach to GAs, one quarter (6 out of 24) of 


the available bent functions were located. This is of interest considering that the GA had 
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processed a total of 77 functions, some of them repeated due to the nature of the GA, that 
were bent during this time frame. Careful design of the GA, and crosscode selection, will 


help the GA locate bent functions more rapidly. 
C; WHY RECONFIGURABLE COMPUTING 


The primary advantage of reconfigurable computing is the ability to work in 
parallel. Consider the sieving computation for bent functions on 6 variables. The 
distance between each function and 128 affine functions must be computed. This can be 
done in parallel. If the GA is applied to a population of 16 chromosomes at one time, this 
can be considered as a parallel computation involving 16 separate processes acting 
simultaneously. Despite the relatively few chromosomes in the population for this GA, 
the reconfigurable computer operating at 100 MHz is quickly capable of outpacing a 


general purpose computer operating at 2.8 GHz. This can be seen in Figure 61. 


The FPGA based GA was translated into C++, and a comparison was conducted. 
The C++ code was executed on the same CPU that is connected to the FPGA, a 32-bit 2.8 
GHz Xeon processor. The C++ implementation includes two versions. The first does not 
include the half-life and Order 67 logic (C++ simplified), while the second version (C++ 
full) does. The graph shows the number of CPU clock cycles required to run the GA for 
a given number of generations. For example, the FPGA GA implementation requires 
fewer clock cycles compared to C++ full when the number of generations exceeds 
approximately 7,000. And, it requires fewer clock cycles compared to C++ simplified 


version when the number of generations exceeds approximately 14,000. 


This result can also be seen in Figure 62, which shows the accumulated CPU 
clock cycles versus the generation. For example, at approximately 7,000 generations and 
above, the FPGA GA requires fewer clock cycles than C++ full. At approximately 
14,000 generations and above, the FPGA requires fewer clock cycles than C++ 
simplified. The reason that the FPGA curve is relatively flat is because the amount of 
time it takes to run the GA is insignificant compared to the time required to execute the 
other instructions. This is similar to the difference in time it takes to list a large amount 


of files in Linux. For example, it takes considerably less time to list the files while 
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redirecting the output to a file than to display them to the screen. The number of clock 
cycles in all cases is determined by using the C “clock()” library function. Since this GA 
is able to quickly locate fit functions due to the small search space, the FPGA 
implementation of the GA may be better suited for a larger search space, meaning more 


bits in the chromosome. 
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Figure 61 Speed advantage of reconfigurable computing 
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Figure 62 Generations per CPU clock cycle 


D. MEETING GOALS 


The goal of this thesis was to determine if GAs are useful in finding bent 
functions. This thesis shows that GAs are a useful tool to locate bent functions. 
Furthermore, it shows that other genetic processes are useful for generating bent 
functions from a given bent function. For the first time at NPS we have enumerated all 
bent functions on n =6 and all ROTS bent functions on n=8. Finally, it was discovered 
that bent functions can be discovered in groups. This discovery may be useful in a future 


GA searching for bent functions by degree. 


E. FUTURE WORK 


Through the course of this thesis several issues were discovered that warrant 


additional research. This includes the grouping of index adjacent bent functions and 
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generating bent functions for a ROTS seed. Finally, many sub-programs were created 


that can be expanded and integrated to facilitate easier testing. 


1 ROTS Seed 


Additional testing should be done on the ROTS seed method of discovering bent 
functions. The test should focus on what combinations are required to yield all bent 
functions. Furthermore, analysis should be conducted on the mathematical properties of 


the seed functions that yield all bent functions. 


Furthermore, this experiment should be expanded to study functions of more 
variables. It is proposed that the next case to be examined is n=8 since all bent 
functions on n=6 have already been enumerated in this thesis. The addition of new A- 
classes of bent functions will provide more research opportunities than revisiting 


functions of fewer variables. 


It is proposed that the ROTS seed algorithm be modified to allow searching for 
bent functions in more variables. This can be accomplished by programming the co- 
functions in a hexadecimal character string versus as an integer value. In this method, 
co-functions can have their bits reversed by simply parsing the character string from the 
null terminating character to the first character in the string. As each character is parsed, 
its integer-based hexadecimal value can be computed, and corresponding reversed 
hexadecimal value be determined through an array based table lookup. Similarly, when 
two co-functions are exclusive ORed together, this can be done on a character by 
character basis of their representative hexadecimal strings. Again, the integer based 
values can be determined from each character. After the two values are exclusive ORed 
together, the resulting value can be used as an index into a different lookup table to yield 
the hexadecimal based character of the string. This is all easily accomplished in C++ 
using the Standard Template Library. Finally, the proposed bent function strings can be 
printed to a text file which can then already be read by an existing bent function 


calculator to determine the nonlinearity of the functions. 
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2. By Bitstuffing on n =10 


It is proposed that a ROTS bent function can be created through a similar process 
to the GA process on n=6. For this case, there are 108 bits in the ROTS index. Of 
these bins, 99 have a weight of 10. Since the goal is to create a function with 496 ones in 
it, one of the primary ways to do that is to have 49 of these 99 bins selected, thus giving a 
weight of 490. The remaining weight can then be evaluated through brute force 
manipulation of the remaining bins. Building a ROTS index that has exactly 49 ones in it 
can be done as follows. Create a data structure that has 99 elements in it. Initialize 49 of 
the elements to a 1, with the remaining to 0. Shuffle the position of the elements through 
some random process. If this process is to be implemented in C++ with the Standard 
Template Library, the random_shuffle algorithm can be used to accomplish this. The 
resulting list can then be converted into a couple of 64 bit numbers by iterating through 
the list and performing the appropriate bitwise operations on the required variables. This 
99 bit value is then processed by the FPGA along with the brute force manipulation of the 


remaining bits, and the nonlinearity is computed. 


In order to perform the nonlinearity calculation on n=10, a method similar to 
that used on n =8 can be used. In n =8, which affine function that is going to be used is 
controlled by a control line to a multiplexer. In n=10, which has four times as many 
bits in the TT, the number of controls lines becomes three. However, instead of a 
multiplexer being used to store the affine function values, a ROM is instantiated that 
contains all of the affine functions. It is instantiated 128 times. The value accessed by 
the ROM is composed of two factors. The first is a constant corresponding to the ROM 
number. For example, the first ROM would have seven of its address lines being tied to 
the number 0. Likewise, the last ROM would have seven of its address lines tied to the 
number 127. The remaining three address lines are provided from by the loop in the 
subr.mc file. This gives the ROM the ability to access 1,024 different affine functions. 
As with the case of n=8, a “shortcut” is used that takes into consideration the 
relationship that an affine function has with its complement on the nonlinearity of a 


function. 
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3. Rework on n=6 


The fitness function should be rewritten for n=6 to facilitate looking for GA 
chromosomes that correspond to a ROTS function that has 36 ones instead of 28. 
Another possibility is to include a version that searches for both conditions, and selects 
the best case. This can be accomplished by first examining the flowchart created for the 
current fitness function. The corresponding tables can be recomputed for the new set of 


circumstances that will result in correctly answering the subproblems. 
4. More Efficient Use of Memory Transfers 


In order to transfer information between the microprocessor and the FPGAs, a 
series of memory transfer protocols are used. Currently the algorithm is only using one 
of the data paths between the two computers. This limits the number of generations that 
can be run in one execution of the GA. Although this is not a factor in the current GA, 
the ability to run it over more generations may become necessary in other problems. 
Furthermore, the memory transfers can be sped up in a few places by utilizing different 


striping patterns as discussed in the SRC-6 literature. 
3. Calculators 


Currently a series of calculators are used to send data from one portion of the 
computation to another. For example, the GA is computed in one executable, and uses 
input/output redirection to send its data to a text file. The text file is, in turn, read into a 
different file that converts the chromosome into the normal ROTS index expression, and 
then into its subsequent truth table. Finally, another program performs the nonlinearity 
calculations on these truth tables. This cumbersome process is primarily hampered by the 
need to read text data as a hexadecimal character string and convert it into its numerical 
value. Furthermore, this is done in groups of eight before the data is passed to the FPGA. 
This is drastically underutilizing the amount of bandwidth that exists between the 
microprocessor and the FPGA. The decision to do this is based on the fact that 8 copies 
of the nonlinearity calculation can be placed on the FPGA at one time. Thus to simplify 


processing these calculations, the rest of the program was hindered. 
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These calculators can be rewritten in at least two ways. The first is to send more 
than 8 values to the FPGA for processing at one time. The most desirable solution would 
be to combine all of the programs into a single project. In this method, would be 
recommended for implementation only after all of the calculators are working 
independently. This is based on the considerable amount of time it takes to compile each 
of the separate programs. One variant of this would have several subr.mc files in the 
Makefile, each calling their own calculator. Another version would incorporate all of the 
calculators into one subr.mc file, and having a control signal from main.c determine 


which calculator is to be used. 
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APPENDIX A. STATEFUL MACROS SRC-6 


Of interest in this research is the case of a “stateful” macro. As previously 
mentioned, there are several different possible flags that can be set. One of these flags 
makes the macro stateful. This is the method that the SRC-6 designers created to 
implement macros that would retain their state from one iteration of a FOR loop to the 
next. In order to prevent the macro from continuing to execute when it should not be 
executing, additional control signals are made to this macro. The different ways to 
control these signals is beyond the scope of this research. Only their functional result is 


of interest. 


The three control signals that must be implemented for stateful macros are 
CLEAR, VALID and ITERATION (or ITER). The CLEAR signal would most often be 
set as a conditional expression in the macro call. An example is when a macro is called 
for the first time. In this case, the values in the registers should be initialized. Thus, the 
programmer would control that through the macro call with a conditional expression such 


as: 
my_macro(times==0, in, &out); 


In this case, a FOR loop increments the variable times, and when times is 0 
(normally during the first execution of the loop), the clear signal is generated and sent to 


the macro. 


The ITER signal is used to ensure that the macro is only processing data once 
during the iteration of the FOR loop. Various stores to memory might cause the loop to 
be “slowed down.” Loop slow down becomes an issue when several memory writes are 
required to the same memory bank. When a loop is slowed down, the ITER signal 
prevents operation of the macro while the FOR loop is processing other lines of code 
within the FOR loop. The VALID signal only remains high during the time that the 


macro is being executed. 


The final signal, ITER, is used when the number of clocks per iteration of a loop 


is greater than one. This allows for providing a pulse that is automatically generated once 
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per iteration. By doing so, an incrementer could be programmed such that it will only 
increment once per loop iteration. This signal, along with the VALID signal, is useful for 


creating enable flip-flops. 


Figure 62 is taken from [15] to provide a graphical representation of these signals 


during execution of a stateful macro. 
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Figure 63 Stateful macro timing diagram, From [15] 
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APPENDIX B. SRC-6 LESSONS LEARNED 


A. MACROS IN A LOOP 


When a “function” call is made to a macro, it looks like any other function call 
made in C. In discussions with Jeff Hammes of SRC Computers, new information was 
learned about the behavior between the C portion of the code and user defined macros. 
In a conventional C program, when a function call is made, the appropriate registers are 
loaded and a jump is executed to the specified portion of memory that contains the 
function code. This is not the same on the SRC-6 when a function call to a macro is 
made. Conceptually, each call to a macro is laid down on the FPGA separately. This 
means that if you have two macro calls, one circuit will be placed on one part of the 
FPGA, while an identical circuit will be placed at a different part. Now, consider the 
case where the C code makes a function call from within a FOR loop. Other looping 
methods can be used to accomplish this. But, for the sake of brevity, only the FOR loop 
will be discussed. In this case, only one circuit is placed on the FPGA. When the C code 
is compiled onto the FPGA, the FOR loop is then translated into a machine which sends 
control signals to the one instance of the macro. This became of interest to this research, 
since the concept of feedback is necessary for the implementation of the genetic 
algorithm. It is because of this property that the value in a register stored at the end of 
iteration of a loop will still be present during the next iteration of the loop. This in effect 
creates a stateful macro, although without the normal control signals associated with a 


stateful macro. 
B. TIMER ACCESS 


The initial idea to provide a random source for ROM address in the macro was to 
use the timer in the subr.mc file. This is not possible since a call to the timer function 
made inside a loop will disable loop pipelining. By doing this, the GA macro cannot be 


made stateful, which the current implementation of the GA requires. Since that is a 
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considerable undertaking, the decision was made to create random element in the main.c 
file, and then allow incrimination along with the CRC function to provide the random 


access to the ROMs. 
Cc. MAKEFILE OPTIONS 


An attempt was made to change the mode of operation of the GA in which the 
latency of the GA was changed to 1, vice the normal 17. In doing so, a new problem 
would be computed each clock cycle until the feedback brings the old values back to the 
input. When the normal version of the circuit was implemented, it was always possible 
to meet the 100.0 MHz timing requirement for the SRC-6. However, changing the 
latency of the circuit to 1 caused the frequency to drop to 90.8 MHz. After consulting the 
Xilinx place and route and mapping documentation, options were found that when used 
sped up the circuit. Table 31 shows the various options that were used and the resulting 


frequency achieved. 























Slowed version of circuit — baseline 100.0 MHz 
Fast 90.8 MHz 
Utilizing MAP E 92.8 MHz 
-timing option 94.0 MHz 
Removing “extra” inputs, extra fanouts 100.0 MHz 


Table 31. | Place and route and mapping options 


Normally the microprocessor portion of the program is contained within a file 
called main.c. However, in order to use some more advanced programming techniques it 
is desired that C++ be used. In order to accomplish this, two things must be changed. 
First, the main.c needs to be renamed to main.cc. Second, in the makefile, the linker 


option needs to be set as follows: LD = icpc. 
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APPENDIX C. AUXILIARY PROGRAMS 


A. INFOER 


In order to help produce this project several other side programs were created in 
C++. The following is a brief description of them. The first is a program called “infoer”’. 
It parses the input Verilog macro and automatically generated the interface files that are 


required for the C FPGA code to be able to call a user defined macro. 
B. CODER 


The next program is entitled “coder”. It parses a user input string to produce 
code. This is helpful in many instances that would normally require repetitive typing. A 
simple example of this would be in the C portion of the FPGA code. Many times 16 
variables need to be passed to and from the Verilog macro. The coder makes it possible 
to type the following line to produce 16 lines of code that would assign a variable to an 


element in an array: 
!10:15 in%d=IN[%d];\n! 
The resultant code that is generated would be: 
inO=IN[O]; , inl=IN[1]; 


and so forth. This has an advantage over the Verilog generate statement. For example, 
compiling the transeunt triangle for n>9 was not possible since it uses generate 
statements, and an out of memory error is eventually reached. However, if the transeunt 
triangle code for n=10 will compile if the generate statements are replaced with the 


actual lines of code that are to be generated. 
C. VERILOG GENERATOR 


The next program that was created is called Verilog Generator. It has been 
expanded from previous projects to include generating the ROMs discussed in this 


project. This is important because it allows creating new ROMs with either different 
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words, or more words. It also contains the code necessary to implement the CRC 
circuitry. Finally, a Verilog standard library was created. This library contains various 
types of flip-flops, e.g. those with an enable, set, reset, and multiplexors that were used. 
This proved useful for the times that it is more practical to use structural Verilog versus 


behavioral. 
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APPENDIX D. GA CODE 


module CRC_table_rom(adr, val); 
//Creates a CRC-32 lookup table 
input [7:0] adr; 
output [31:0] val; 
reg [31:0] val; 
always @(adr) 
case (adr) 
0: val = 32'h00000000; 
1: val = 32'h77073096; 


2: val = 32'hee0e612c; 
3: val = 32'h99095lba; 
4: val = 32'h076dc419; 


val = 32'h706af48f; 
6: val = 32'he963a535; 
7: val = 32'h9e6495a3; 


8: val = 32'h0edb8832; 








9: val = 32'h79dcb8a4; 
LO: val = 32'he0d5e91le; 
11: val = 32'h97d2d988; 
12: val = 32'h09b64c2b; 
13: val = 32'h7ebl7chd; 
14: val = 32'he7b82d07; 
15: val = 32'h90bf1d91; 
16: val = 32'h1ldb71064; 
L7: val = 32'h6ab020£2; 
18: val = 32'h£3b97148; 
19: val = 32'h84be41de; 








20: val = 32'hladad47d; 
21: val = 32'h6ddde4eb; 
22: val = 32'hf4d4b551; 


23: val = 32'h83d385c7; 
24: val = 32'h136c9856; 


25: val = 32'h646ba8c0; 
26: val = 32'hfd62f97a; 
27: val = 32'h8a65c9ec; 
28: val = 32'h14015c4f; 


29: val = 32'h63066cd9; 
30: val = 32'hfa0f3d63; 








31: val = 32'h8d080df5; 
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32: 
32% 
32" 
32;' 
32" 
32 
32' 
32' 
32" 
32" 
32" 
32 
32" 
32:3 
32' 


h5768b525; 
h206f£85b3; 
hb966d409; 
hce61e49f; 
h5edef90e; 
h29d9c998; 
hb0d09822; 
hce7d7a8b4; 
h59b33d17; 
h2eb40d81; 
hb7bd5c3b; 
hcOba6cad; 
hedb88320; 
h9abfb3b6; 


h03b6e20c; 





h74b1d29a; 
head54739; 
h9dd27/af; 
h04db2615; 
h73dc1683; 
he3630b12; 
h94643b84; 
h0d6d6a3e; 
h7a6a5aa8; 
he40ecf0b; 
h9309f£9d; 
hO0a00ae27; 
h7d079eb1; 
hf00f9344; 
h8708a3d2; 
hleQ1f268; 
h6906c2fe; 
h£762575d; 
h806567cb; 
h196c3671; 
h6e6b06e7; 
hfed41b76; 
h89d32be0; 
hl0da7a5a; 
h6é7dd4acc; 


hf9bId£6F; 





h8ebeeff9; 


102 





58: 
59% 
60: 
61: 
62: 
63: 
64: 
65: 
66: 
67: 
68: 
693: 
70: 
71: 
2s 
ihe 
74: 
Lat 
76: 
a is 
78: 
79: 
80: 
81: 
82: 
83: 
84: 
85: 
86: 
87: 
88: 
89: 
90: 
91: 
92°: 
93°: 
94: 
95: 
96: 
97: 
98: 
99: 


val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 


val 


32" 
32: 
32' 
32" 
32:' 
32" 
32s 
32: 
32" 
32" 
32" 
32; 
32: 
32" 
323 
32' 
32" 
32;' 
32" 
32: 
32' 
32" 
32" 
32" 
32:" 
32 
32" 
32: 
32" 
32" 
32:" 
32" 
32s 
32: 
32' 
32" 
32" 
32? 
32: 
32" 
32: 
32' 


hl7b7be43; 


h60b08ed5; 


hd6d6a3e8; 


na 


d1937e; 


h38d8c2c4; 


h4 


hd 


Fdff£252; 


bb67£1; 


ha6bc5767; 


h3 





fb506dd; 


h48b2364b; 


hd80d2bda; 


na 


fFOalb4c; 


h36034af6; 


h4 


hd 


047a60; 
f6é0efc3; 


ha867df£55; 


h3 





6e8eef; 


h4669be79; 


hcb61b38c; 


hbc66831a; 


h256fd2a0; 


h5268e236; 


hec0c7795; 


hbb0b4703; 


h220216b9; 


n5505262f; 


hc5ba3bbe; 


hb2bd0b28; 


h2bb45a92; 


h5cb36a04; 


he2d7ffa7; 


hb5d0cf31; 


h2cd99e8b; 


h5bdeaeld; 


h9b64c2b0; 


hec63f£226; 


h756aa39c; 


h026d930a; 


h9c0906a9; 


heb0e363f; 


h72076785; 





h05005713; 
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WN 


val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 
val 


val 


32" 
32' 
32" 
32" 
32" 
32' 
32" 
32: 
32) 
32" 
32" 
32: 
32: 
32" 
32' 
32" 
32 
32: 
32" 
32s" 
32: 
32" 
32" 
32" 
32:" 
32s 
32" 
32: 
32% 
32" 
32;! 
32" 
32 
32: 
32) 
32: 
32" 
32? 
32 
32" 
323 
32' 


h95 
he2 
h7b 
hOc 
h92 
he5 
h7c 
hOb 
h86 
hf 


hé8 


Fh 


6 





h6of 
h18 
h88 


Hh 


h 
h66 
h 
hst 
hf8 





hé 





h16 
had 
hd7 
h4e 
h39 
ha7 
hdo 
h49 
h3e 
hae 
hdg 
h40 
h37 
had 
hde 
h47 
h30 


hbd 





nca. 


bf4a82; 
b87al4; 
b12bae; 
b61b38; 
d28e9b; 
d5be0d; 
dcefb7; 
dbdf21; 
d3d2d4; 
d4e242; 
ddb3f8; 
da836e; 
bel6cd; 
b9265b; 
bO77el; 
b74777; 
085ae6; 
Of6a70; 
063bca; 
010b5c; 
659eff; 
62ae69; 
6bf£fd3; 
6ccf45; 
O0ae278; 
Odd2ee; 
048354; 
03b3c2; 
672661; 
6016f7; 
69474d; 
6e77db; 
dl6a4a; 
d65adc; 
df0b66; 
d83bf0; 
bcae53; 
bb9ec5; 
b2cf7f; 
b5ffe9; 


bdf21c; 





bac28a; 
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24 


iS) 


val = 32'h53b39330; 
24 


Ww 


val = 32'h24b4a3a6; 
244: val = 32'hbad03605; 
24 


ol 


val = 32'hcdd70693; 
246: val = 32'h54de5729; 
247: val = 32'h23d967bf; 


248: val = 32'hb3667a2e; 





249: val = 32'hc4614ab8; 
250: val = 32'h5d681b02; 
251: val = 32'h2a6f2b94; 
252: val = 32'hb40bbe37; 
253: val = 32'hc30c8eal; 
254: val = 32'h5a05df1lb; 





255: val = 32'h2d02ef8d; 
default: val=32'h0; 
endcase 


endmodule 


module CRC_calc(in, val); 
//module CRC_calc(A, B, C, D, val); 
input [31:0] in; 
ffanput [70] A, By. Gp De 
output [31:0] val; 
wire [31:0] val; 
wire [31:0] regA, regB, regC, regD; 


wire [31:0] topA, topB, topC, topD; 


//CRC_table_rom tA(8'hff*A, topA); 
CRC_table_rom tA(8'hff*in[31:24], topA); 


assign regA= 32'hOOffffff *~ topA; 


//CRC_table_rom tB(regA[7:0] * B, topB); 
CRC_table_rom tB(regA[7:0] * in[23:16], topB); 


assign regB= {8'h00, regA[31:8]}* topB; 


//CRC_table_rom tC(regB[7:0] * C, topC); 
CRC_table_rom tC(regB[7:0] * in[15:8], topC); 


assign regC= {8'h00, regB[31:8]}* topcC; 


//CRC_table_rom tD(regC[7:0] * D, topD); 


CRC_table_rom tD(regC[7:0] * in[7:0], topD); 





assign regD= {8'h00, regC[31:8]}* topD; 
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assign val= regD* 32'hffffffff; 


endmodule 


module OC (TT, Count); 


input [3:0] TT; 


output [2:0] Count; 


wire [2:0] Count; 


assign Count [0]J=TT[3]*TT[2]*TT[1]*TT[0]; 


assign 
Count [1]=(TIT[3]&TIT[2 
J&TT[2]&TT[1]&TT[0]); 








TT [3] &TT[1] |TT[3] &TT[O] |TT[2] &TT[1] | TT[2]&TT[O] | TT[1]&TT[0]) &~(TT[3 


assign Count [2]=TT[3]&TT[2]&TT[1]&TT[0]; 


endmodule 


module ones64(TT, VALID, RESET, ITER, CLK, count, timer, crcv); 


input 


[63:0] TT; 


input VALID, 


input CLK; 


RESET, 


ITER; 


output 
reg [6 
output 


reg 


6:0] 
20] 
63:0] 





31:0] 


count; 


count; 


timer; 


63:0] timer; 


crev; 


count8, 


output 


reg [31:0] 





reg [63:0] 


wire 
reg [4:0] 


wire [2:0] 


count9, count10, 


CRC_calc 


dff_eNB 


Oc 
Oc 
OC o2(TT 
OC o3(TT 
Oc o4(TT 


OC oa ( IT 











OC o6(TT 


ff (wl, 


o0 (ITT[3: 


oem Gillie. 


23 
27 


crev; 


ai; 


[31:0] wl, 


counta 


count 
countl 


COATT TS 


Ol], c 
4], ¢ 
28], 

2), 





6], 
3207 
$24]; 


w2; 


, countb, 


0, 
1, 


1:0], wl) 


VALID&ITER, CLK, 


ount0) ; 

ountl); 

count2); 
count3); 
count4) ; 
count5); 


count6); 


countl, 
count12, 


countc, 


count2, 


, 


w2) 


106 


count13, 


countd; 


count3, 
countl14, 


, 


count4, 
count15; 


count5, 


OC o7(TT[31:28], count7); 
OC 08 (TT[35:32], count 8); 
OC 09 (TT[39:36], count 9); 
OC 010(TT[43:40], countl10); 
OC o11(TT[47:44], countl1l); 
OC 012(TT[51:48], countl12); 
OC) 613(TT [55:52]; countl3) 


OC 014(TT[59:56], countl14); 




















OC 015(TT[63:60], count15); 


always @(posedge CLK) 

begin 
counta <=count0+countl+count2+count3; 
countb <=+count4+count5+count6+count7; 
countc <=count8+count 9+count10+count11; 
countd <=count12+count13+count14+count15; 


count <=counta+countb+countctcountd; 


dl <= TT; 


timer <= dl; 


crcv <= w2; 


end 


endmodule 


module dff_NB(d, CLK, q); 


parameter n=16; 

input [n-1:0] d; 
input CLK; 

output [n-1:0] q; 

reg [n-1:0] q; 

always @(posedge CLK) 


q <= d; 


endmodule 


module oc5(TT, out); 


input [4:0] TT; 
output [2:0] out; 


reg [2:0] out; 


wire [2:0] val; 


oc4 moc4(TT[3:0], val); 
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always @(TT, val) 
out <= val + TT[4]; 


endmodule 


module oc4 (TT, Count); 
input[3:0] TT; 
output [2:0] Count; 
wire [2:0] Count; 
assign Count [0]J=TT[3]*TT[2]*TT[1]*TT[0]; 


assign 
Count [1]=(TT[3] &TT[2] |TT[3] &TT[1] |TT[3] &TT[0] |TT[2] &TT[1] |TT[2] &TT[0] |TT[1] &TT[0]) &~(TT[3 
]&TT[2]&TT[1]&TT[0]); 


assign Count [2]=TT[3]&TT[2]&TT[1]&TT[0]; 


endmodule 


module fit6(TT, CLK, z); 
parameter wid_fit=8; 
parameter wid_TT=14; 
input [wid_TT-1:0] TT; 
input CLK; 
output [wid_fitt+wid_TT-1:0] 2; 


reg [wid_fitt+wid_TT-1:0] z; 


reg [wid_fit-1:0] zd; 


wire [2:0] ones6a, ones6b, ones3, ones2, onesl; 
wire [2:0] ones6ad, ones6bd, ones3d, ones2d, onesld; 
reg [7:0] ones6; 
reg [7:0] val6, val3, val2, vall; 
reg [wid_TT-1:0] rl_TT;//, r2_TT; 
ocS mocéa(TIT[13:9], ones6ad) ; 
oc4 mocé6b(TT[8:5], ones6bd) ; 
oc4 moc3({2'b00, TI[4:3]}, ones3d); 
oc4 moc2({3'b000, TI[2]}, ones2d); 








oc4 mocl({2'bO, TT[1:0]}, onesld); 


defparam f6a.n=3; 

dff_NB f6a(ones6ad, CLK, ones6a); 
defparam f6b.n=3; 

dff_NB f6b(ones6bd, CLK, ones6b) ; 
defparam f3.n=3; 


dff_NB f£3(ones3d, CLK, ones3); 








defparam f2.n=3; 
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always 


begin 


dff_NB f2(ones2d, CLK, 


defparam fl.n=3; 


dff_NB fl(onesld, CLK, 


always @(posedge CLK) 


ones2); 


onesl); 


begin 
r1_TT <= TT; 
z[wid_fit+wid_TT-1:wid_fit] 
z{wid_fit-1:0] <= zd; 

end 

always @(ones6a, ones6b) 


ones6 <= ones6a + ones6b; 


@(ones6, ones3, 
case (ones6) 

<r 

4: 


ones2, onesl) 
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begin 


end 


begin 


<= rl_TT; 


val6 <= 60; 


case (ones3) 
0: val3 <=0; 
1: val3 <=30; 
2: val3 <=60; 
default: val3 

endcase 

case (ones2) 
0: val2 <=0; 
1: val2 <=60; 
default: val2 

endcase 

case (ones1) 
0: vall <=0; 
1: vall <=30; 
2: vall <=60; 
default: vall 

endcase 

val6 <= 80; 

if (ones2==1) 

begin 
val3 <=0; 


< 


< 


< 


0; 


0; 


default: begin 
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end 


else 


begin 


end 


val2 <=80; 

case (ones1) 
0: vall <=0; 
1: vall <=40; 
2: vall <=80; 


default: vall 


endcase 


val3 <=80; 

val2 <=0; 

if (onesl==1) 
vall <=80; 

else 


vall <=0; 


case (ones6) 


endcase 


0: val6é <=0; 
1: val6 <=10; 
2: val6 <=20; 
5: val6 <=30; 
6: val6 <=40; 
7: val6 <=30; 
8: val6 <=20; 


9: val6 <=10; 


case (ones3) 


endcase 


default: val6 <=0; 
0: val3 <=15; 
1: val3 <=60; 
2: val3 <=15; 
default: val3 <=0; 


case (ones2) 


endcase 


0: val2 <=60; 
1: val2 <=15; 


default: val2 <=0; 


end 


endmodule 


module 


case (ones1) 
0: vall <=15; 
1: vall <=60; 
2: vall <=15; 
default: vall <=0; 
endcase 
end 
endcase 
zd = val6+val3+val2+vall; 


dff_cse(d, CLR, SET, EN, CLK, q); 
input d, CLR, SET, EN, CLK; 
output q; 
reg q 
always @(posedge CLR or posedge SET or posedge CLK) 
af (CLR) 
q<=1'b0; 
else if (SET) 
q<=1'bl; 
else if (EN) 
q<=d; 
endmodule 
module LFSR(CLR, SET, EN, CLK, q); 


parameter n=6; 


//parameter taps=4; 


//Parameter n corresponds to the number of bits in the LFSR 


//Each of the tapX parameters directly corresponds to the maximal tap 


//as shown in Table 3.8 of Dixon 


//In 
initialized with the 


//The 
//The 


//SET 


order to properly 
CLR and SET inputs 


use the LFSR, the register must first 


value in SET is the first value stored in the register 


value of CLR must be the NOT of SET 


and CLR must be LOW after initialization for the LFSR to sequence 
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rndll, 
w_rng4, 


parameter tap0=0; 
parameter tapl=0; 
parameter tap2=0; 
parameter tap3=0; 
input [n-1:0] CLR, SET; 
input EN, CLK; 
output [n-1:0] q; 
wire [n-1:0] q; 
wire inwire; 
assign inwire = q[tap0]*q[tapl]*q[tap2]*q[tap3]; 
dff_cse ff0(inwire, CLR[O], SET[0], EN, CLK, q[0]); 
genvar k; 
generate 
for (k=1; k<n; k=k+1) 
begin: ea_ff 
dff_cse ff(q{[k-1], CLR[k], SET[k], EN, CLK, q[k]); 
end 
endgenerate 


endmodule 


module lfsrs(rnd0, rndl, rnd2, rnd3, rnd4, rnd5, rnd6é, rnd7, rnd8, rnd9, rndlo, 
rndi2, rnd13, rndl4, rnd15, CLR, VALID, ITER, CLK, w_rng0, w_rngl, w_rng2, w_rng3, 
w_rng5, w_rng6, w_rng7, w_rng8, w_rng9, w_rngl0, w_rngll, w_rngl2, w_rngl13, 


w_rngl4, w_rngl5); 


rndlo, 


w_rng7, 


//previous is the 14-bit string from the last generation 


//next is the 22-bit string of {14-bit chromosome, 8-bit fitness value} 


parameter n=14; 


parameter AAA=13; 
parameter BBB=4; 
parameter CCC=2; 


parameter DDD=0; 


input [n-1:0] rnd0Q, rndl, rnd2, rnd3, rnd4, rnd5, rnd6é, rnd7, rnd8, rnd9, 
rndl1l, rnd1l2, rnd13, rnd14, rndl15; 


input CLR, VALID, ITER, CLK; 


output [n-1:0] w_rng0, w_rngl, w_rng2, w_rng3, w_rng4, w_rng5, w_rng6, 
w_rng8, w_rng9, w_rngl0, w_rngll, w_rngl2, w_rngl3, w_rngl4, w_rngl15; 





w_rng7, 


wire [n-1:0] w_erng0, w_rngl, w_rng2, w_rng3, w_rng4, w_rng5, w_rng6, 
w_rng8, w_rng9, w_rngl0, w_rngll, w_rngl2, w_rngl3, w_rngl4, w_rngl15; 
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wire [n-1:0] clears [15:0]; 
wire [n-1:0] sets [15:0]; 
assign sets[0]=rnd0; 
assign sets[1]=rndl; 
assign sets[2]=rnd2; 
assign sets[3]=rnd3; 
assign sets[4]=rnd4; 
assign sets[5]=rnd5; 
assign sets[6]=rnd6; 
assign sets[7]=rnd7; 
assign sets[8]=rnd8; 
assign sets[9]=rnd9; 
assign sets[10]=rnd10; 
assign sets =rndll; 
assign sets[12]=rnd12; 
assign sets[13]=rnd13; 
assign sets[14]=rnd14; 
assign sets[15]=rnd15; 
assign clears[0]=~sets[0]; 
assign clears =~sets[1]; 
assign clears[2]=~sets[2]; 
assign clears[3]=~sets[3]; 
assign clears[4]=~sets[4]; 
assign clears[5]=~sets ; 
assign clears[6]=~sets[6]; 
assign clears[7]=~sets[7]; 
assign clears[8]=~sets[8]; 
assign clears[9]=~sets[9]; 
assign clears[10]=~sets[10 
assign clears =~sets[1l 
assign clears[12]=~sets[12 
assign clears[13]=~sets[13 
assign clears[14]=~sets[14 
assign clears[15]=~sets[15 
defparam rng0.n=14; 
defparam rng0.tap0=AAA; 
defparam rng0.tap1=BBB; 
defparam rng0.tap2=CCC; 
defparam rng0.tap3=DDD; 
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w_rng0[13:0]); 


w_rngl1[13:0]); 


w_rng2[13:0]); 


w_rng3[13:0]); 


w_rng4[13:0]); 


w_rng5[13:0]); 


w_rng6[13:0]); 


LFSR rngO (clears[0]&{1l4{CLR}}, 


defparam rngl.n=14; 

defparam rngl.tap0=AAA; 
defparam rngl.tap1=BBB; 
defparam rngl.tap2=CCC; 
defparam rngl.tap3=DDD; 


LFSR rngl(clears[1]&{14{CLR}}, 


defparam rng2.n=14; 

defparam rng2.tap0=AAA; 
defparam rng2.tap1=BBB; 
defparam rng2.tap2=CCC; 
defparam rng2.tap3=DDD; 


LFSR rng2 (clears[2]&{14{CLR}}, 


defparam rng3.n=14; 

defparam rng3.tap0=AAA; 
defparam rng3.tap1=BBB; 
defparam rng3.tap2=CCC; 
defparam rng3.tap3=DDD; 


LFSR rng3 (clears[3]&{1l4{CLR}}, 


defparam rng4.n=14; 
defparam rng4.tap0=AAA; 
defparam rng4.tap1=BBB; 


defparam rng4.tap2=CCC; 





defparam rng4.tap3=DDD; 


LFSR rng4(clears[4]&{14{CLR}}, 


defparam rng5.n=14; 

defparam rng5.tap0=AAA; 
defparam rng5.tap1=BBB; 
defparam rng5.tap2=CCC; 
defparam rng5.tap3=DDD; 


LFSR rng5(clears[5]&{14{CLR}}, 


defparam rng6.n=14; 

defparam rng6.tap0=AAA; 
defparam rng6.tap1=BBB; 
defparam rng6.tap2=CCC; 
defparam rng6.tap3=DDD; 


LFSR rng6(clears[6]&{1l4{CLR}}, 


defparam rng7.n=14; 
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sets[0]&{14{CLR}}, 


sets[1]&{14{CLR}}, 


sets[2]&{14{CLR}}, 


sets[3]&{14{CLR}}, 


sets[4]&{14{CLR}}, 


sets[5]&{14{CLR}}, 


sets[6]&{14{CLR}}, 


VALID&ITER, 


VALID&ITER, 


VALID& ITER, 


VALID&ITER, 


VALID&ITER, 


VALID&ITER, 


VALID& ITER, 


CLK, 


CLK, 


CLK, 


CLK, 


CLK, 


CLK, 


CLK, 


defparam rng7.tap0=AAA; 
defparam rng7/.tap1=BBB; 
defparam rng7.tap2=CCC; 
defparam rng7.tap3=DDD; 


LFSR rng7(clears[7]&{14{CLR}}, sets[7]&{14{CLR}}, VALID& ITER, CLK, 
w_rng7[13:0]); 


defparam rng8.n=14; 

defparam rng8.tap0=AAA; 
defparam rng8.tap1=BBB; 
defparam rng8.tap2=CCC; 
defparam rng8.tap3=DDD; 


LFSR rng8 (clears[8]&{1l4{CLR}}, sets[8]&{l4{CLR}}, VALID&ITER, CLK, 
w_rng8[13:0]); 


defparam rng9.n=14; 

defparam rng9.tap0=AAA; 
defparam rng9.tap1=BBB; 
defparam rng9.tap2=CCC; 
defparam rng9.tap3=DDD; 


LFSR rng9(clears[9]&{1l4{CLR}}, sets[9]&{1l4{CLR}}, VALID&ITER, CLK, 
w_rng9[13:0]); 


defparam rngl0.n=14; 

defparam rngl0.tap0=AAA; 
defparam rngl0.tap1=BBB; 
defparam rngl0.tap2=CCC; 
defparam rngl0.tap3=DDD; 


LFSR rng10(clears[10]&{14{CLR}}, sets[10]&{14{CLR}}, VALID&ITER, CLK, 
w_rngl10[13:0]); 


defparam rngll.n=14; 

defparam rngll.tap0=AAA; 
defparam rngll.tap1=BBB; 
defparam rngll.tap2=CCC; 
defparam rngll.tap3=DDD; 


LFSR rngll(clears[11]&{14{CLR}}, sets[11]&{14{CLR}}, VALID& ITER, CLK, 
w_rngl11[13:0]); 


defparam rng12.n=14; 

defparam rngl2.tap0=AAA; 
defparam rngl2.tap1=BBB; 
defparam rngl2.tap2=CCC; 
defparam rngl2.tap3=DDD; 


LFSR rng12(clears[12]&{14{CLR}}, sets[12]&{14{CLR}}, VALID&ITER, CLK, 
w_rng12[13:0]); 


defparam rng13.n=14; 
defparam rngl3.tap0=AAA; 


defparam rngl3.tap1=BBB; 
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w_rng13[13:0]); 


w_rngl14[13:0]); 


w_rng15[13:0]); 


defparam rngl3.tap2=CCC; 


defparam rngl3.tap3=DDD; 


LFSR 


defparam rngl4.n=14; 


defparam rngl14 
defparam rngl14 
defparam rngl14 
defparam rngl14 


LFSR 


defparam rng15.n=14; 


defparam rngl15 
defparam rng15 


defparam rngl15 








defparam rng 


LFSR 


endmodule 


rng13(clears[13]&{14{CLR}}, 


rng14(clears[14]&{14{CLR}}, 


rng15(clears[15]&{14{CLR}}, 


sets[13]&{14{CLR}}, 


. tapO=AAA; 
.tap1=BBB; 
.tap2=CCC; 


.tap3=DDD; 


sets[14]&{14{CLR}}, 


. CtapO=AAA; 
.tap1=BBB; 
.tap2=CCC; 


5.tap3=DDD; 


sets[15]&{14{CLR}}, 


module slowhalflife(TT, CLR, VALID, ITER, CLK, clear); 
parameter wid_TT=14; 
input [wid_TT-1:0] TT; 
input CLR, VALID, ITER, CLK; 
output clear; 
wire clear; 
reg [wid_TT-1:0] dl, d2, d3; 
reg [2:0] isequal; 
assign clear = &isequal; 


always @(posedge CLK 


begin 

if (CLR) 

begin 
dl <= 
d2 <= 
a3 <= 

end 

else 


or posedge CLR) 


{wid_TT{1'b1}}; 
{wid_TT{1'b0}}; 


{wid_TT{1'b1}}; 
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VALID&ITER, 


VALID& ITER, 


VALID& ITER, 


CLK, 


CLK, 


CLK, 


begin 


if (VALID&ITER) 


begin 
dl <= TT; 
d2 <= dl; 
d3 <= d2; 
end 


end 


end 


always @(TT, dl, d2, d3) 


begin 
isequal[0] <= (TT==d1)?1:0; 
isequal[1] <= (dl==d2) ?1:0; 
isequal[2] <= (d2==d3)?1:0; 
end 
endmodule 


module clearunit(cur0, curl, cur2, cur3, cur4, cur5, cur6, cur7, 
curl0, curll, curl2, curl3, curl4, curl5, CLR, VALID, ITER, CLK, cleared); 


input [13:0] cur0, curl, cur2, cur3, cur4, cur5, cur6, cur7, 
curl0,--curll, curl2; curl3, curl4,. curli5; 


input CLR, VALID, ITER, CLK; 
output [15:0] cleared; 


wire [15:0] cleared; 


wire [15:0] cl; 


reg [2:0] c2; 




















//slowhalflife(TT, CLR, VALID, ITER, CLK, clear); 
slowhalflife st12_0(cur0, CLR, VALID, ITER, CLK, c1[0]); 
slowhalflife st12_1(curl, CLR, VALID, ITER, CLK, cl[1]); 
slowhalflife st12_2(cur2, CLR, VALID, ITER, CLK, cl[2]); 
slowhalflife st12_3(cur3, CLR, VALID, ITER, CLK, cl1[3]); 
slowhalflife st12_4(cur4, CLR, VALID, ITER, CLK, cl1[4]); 
slowhalflife st12_5(cur5, CLR, VALID, ITER, CLK, cl1[5]); 
slowhalflife st12_6(cur6, CLR, VALID, ITER, CLK, cl1[6]); 
slowhalflife st12_7(cur7, CLR, VALID, ITER, CLK, cl[7]); 
st12_8(cur8, CLR, VALID, ITER, CLK, c1[8]); 
st12_9(cur9, CLR, VALID, ITER, CLK, cl1[9]); 
st12_10(curl0, CLR, VALID, ITER, CLK, cl1[10]); 
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cur8, 


cur8, 


cur 9, 


cur 9, 
























































slowhalflife st12_11(curll, CLR, VALID, ITER, CLK, cl[11]); 
slowhalflife st12_12(curl2, CLR, VALID, ITER, CLK, cl[12]); 
slowhalflife st12_13(curl3, CLR, VALID, ITER, CLK, cl[13]); 
slowhalflife st12_14(curl4, CLR, VALID, ITER, CLK, cl1[14]); 
slowhalflife st12_15(curl5, CLR, VALID, ITER, CLK, cl[15]); 
assign cleared[0] = |c2 CLR ello); 

assign cleared[1] = |c2 CLR cl[1]; 

assign cleared[2] = |c2 CLR cl1[2]; 

assign cleared[3] = |c2 CLR e1 [3]; 

assign cleared[4] = |c2 CLR c1[4]; 

assign cleared[5] = |c2 CLR er [5]; 

assign cleared[6] = |c2 CLR cl1[6]; 

assign cleared[7] = |c2 CLR cl[7]; 

assign cleared[8] = |c2 CLR cl1[8]; 

assign cleared[9] = |c2 CLR G1 [-9:] 3 

assign cleared[10] = |c2 CLR c Oj; 

assign cleared = |c2 CLR c Ls 

assign cleared[12] = |c2 CLR c 2% 

assign cleared[13] = |c2 CLR ic 319 

assign cleared[14] = |c2 CLR Cc 4]; 

assign cleared[15] = |c2 CLR G 54 


always @(cur0, curl, cur2, cur3) 


begin 
c2[0] <= (cur0==curl) ?1:0; 
ce2[1] <= (curl==cur2) ?1:0; 
c2[2] <= (cur2==cur3) ?1:0; 
end 
endmodule 


module strgen(rnd0O, rndl, rnd2, rnd3, rnd4, rnd5, rnd6, rnd7, rnd8, rnd9, rndlo, 
rndll, rndl2, rndl3, rndl4, rndl5, prev0O, prevl, prev2, prev3, prev4, prev5, prev6, 
prev7, prev8, prev9, prevl0, prevll, prevl2, prevl3, prevl4, prevl5, reset_val, CLR, 
VALID, ITER, CLK, r_next0O, r_nextl, r_next2, r_next3, r_next4, r_next5, r_next6, r_next7, 
r_next8, r_next9, r_next10,r_nextll, r_next12, r_next13, r_next14, r_nextl15); 


//previous is the 14-bit string from the last generation 
//next is the 22-bit string of {14-bit chromosome, 8-bit fitness value} 
parameter wid_TT=14; 


parameter wid_fit=8; 
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input [13:0] rnd0O, rndl, rnd2, rnd3, rnd4, rnd5, rnd6é, rnd7, rnd8, rnd9, 
rnd10, rndll, rndl2, rnd13, rnd14, rndl15; 

input[13:0] prev0, prevl, prev2, prev3, prev4, prev5, prev6, prev7, prev8, 
prev9, prevl0, prevll, prevl2, prevl3, prevl4, prev15; 

input [7:0] reset_val; 

input CLR, VALID, ITER; 

input CLK /* synthesis syn_noclockbuf=1 syn_maxfan=100000 */ ; 

output [21:0] r_next0, r_nextl, r_next2, r_next3, r_next4, r_next5, 
r_next6, xrinext7, r_next8, r_next9, r_nextl10,r_nextll, r_nextl2, r_next13, r_nextl14, 
r_next15; 

reg [21:0] r_next0O, r_nextl, r_next2, r_next3, r_next4, r_next5, r_next6, 
r_next7, r_next8, r_next9, r_nextl10,r_nextll, r_nextl12, r_next13, r_next14, r_next15; 

a aa aaa a a 

reg [wid_TT+twid_fit-1:0] next0O, nextl, next2, next3, next4, next5, next6, 
next7, next8, next9, nextl10, nextll, next12, next13, nextl14, next15; 

reg [13:0] prev0dl, prevldl, prev2dl, prev3dl, prev4dl, prev5dl, prevédl, 
prev7dl, prev8dl, prev9d1, prevl0dl, prevlldl, prevl2dl, prevl3dl, prevl4dl, prevl5dl; 

wire [wid_TT-1:0] w_rng0, w_rngl, w_rng2, w_rng3, w_rng4, w_rng5, w_rng6, 
w_rng7, w_rng8, w_rng9, w_rngl0, w_rngll, w_rngl12, w_rngl13, w_rngl4, w_rng15; 

wire [wid_TT+wid_fit-1:0] w_fit_ro, w_fit_rl, w_fit_r2, w_fit_r3, 
w_fit_r4, w_fit_r5, w_fit_r6, We tit 1; w_fit_r8, w_fit_r9, w_fit_r10, w_fit_rl1l, 
w_fit_r12, w_fit_r13, w_fit_r14, w_fit_r15; 

wire [wid_TT+wid_fit-1:0] w_fit_p0, w_fit_pl, w_fit_p2, w_fit_p3, 
w_fit_p4, w_fit_p5, w_fit_p6, w_fit_p7, w_fit_p8, w_fit_p9, w_fit_pl10, w_fit_pll, 
w_fit_pl2, w_fit_pl3, w_fit_pl4, w_fit_p15; 

wire [15:0] clearer; 

lfsrs 1l(rnd0O, rndl, rnd2, rnd3, rnd4, rnd5, rnd6é, rnd7, rnd8, rnd9, rndlod, 
rndl1, rndl2, rnd13, rndl4, rndi5, CLR, VALID, ITER, CLK, w_rng0, w_rngl, w_rng2, w_rng3, 
w_rng4, w_rng5, w_rng6, w_rng7, w_rng8, w_rng9, w_rngl0, w_rngll, w_rngl2, w_rngl13, 
w_rngl4, w_rngl5); 

fit6e fit_r0O(w_rng0[13:0 CLK, w_fit_r0); 

fitée fit_rl(w_rng1[13:0 CLK, w_fit_r1l); 

fit6 fit_r2(w_rng2[13:0 CLK, w_fit_r2); 

fit6e fit_r3(w_rng3[13:0 CLK, w_fit_r3); 

fitée fit_r4(w_rng4[13:0 CLK, w_fit_r4); 

fit6e fit_r5(w_rng5[13:0 CLK; “wlfit 25); 














prev8, prev9, 
clearer); 


:0]<reset_val 


:0]<reset_val 


:0)]<reset_val 


:0]<reset_val 








:0]<reset_val 






















































































Fite fit_r6(w_rng6[13:0], CLK, w_fit_r6); 
Fit6é fit_r7(w_rng7[13:0], CLK, w_fit_r7); 
fite fit_r8(w_rng8[13:0], CLK, w_fit_rs); 
Fit6 fit_r9(w_rng9[13:0], CLK, w_fit_r9); 
Fite fit_r10(w_rng10[13:0], CLK, w_fit_r10); 
fit6e fit_rll(w_rng 3:0], CLK, w_fit_rl1l); 
Fite fit_r1l2(w_rng12[13:0], CLK, w_fit_rl12); 
Fit6é fit_r13(w_rng13[13:0], CLK, w_fit_r13); 
fite fit_r14(w_rng14[13:0], CLK, w_fit_rl14); 
Fite fit_r15(w_rng15[13:0], CLK, w_fit_r15); 
fFit6 fit_pO(prev0dl, CLK, w_fit_p0); 
fFit6é fit_pl(prevldl, CLK, w_fit_pl); 
fFit6 fit_p2(prev2dl, CLK, w_fit_p2); 
fit6e fit_p3(prev3d1, CLK, w_fit_p3); 
Fit6 fit_p4(prev4dl, CLK, w_fit_p4); 
Fit6é fit_p5(prev5dl, CLK, w_fit_p5); 
fit6 fit_p6(prev6dl, CLK, w_fit_p6); 
fit6 fit_p7(prev/dl, CLK, w_fit_p7); 
fFit6 fit_p8(prev8dl, CLK, w_fit_p8); 
fit6e fit_p9(prev9d1, CLK, w_fit_p9); 
Fite fit_pl0(prevl0dl1, CLK, w_fit_pl10); 
Fit6é fit_pll(prevlldl, CLK, w_fit_pll); 
fFit6 fit_pl2(prevl2d1, CLK, w_fit_pl2); 
Fite fit_pl3(prev1l3d1, CLK, w_fit_pl13); 
fit6 fit_pl4(prevl4dl1, CLK, w_fit_pl4); 
Fite fit_pl5(prevl5dl1, CLK, w_fit_p15); 
clearunit clru(prev0O, prevl, prev2, prev3, prev4, 
prevl0, prevll, prevl2, prev13, prevl4, prevl5, 
always @(*) 
begin 
next0 wid_TT+wid_fit-1:0]<= ((clearer 
7:0))) ?w_fit_r0O:w_fit_p0; 
nextl wid_TT+wid_fit-1:0]<= ((clearer 
7:0))) ?w_fit_rl:w_fit_pl; 
next2 wid_TT+wid_fit-1:0]<= ((clearer 
7:0))) ?w_fit_r2:w_fit_p2; 
next3 wid_TT+wid_fit-1:0]<= ((clearer 
7:0))) ?w_fit_r3:w_fit_p3; 
next4 wid_TT+wid_fit-1:0]<= ((clearer 
7:0))) ?w_fit_r4:w_fit_p4; 
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CLR, 











prev5, 








prev6, prev7, 
VALID, ITER, CLK, 
(w_fit_p0 [wid_fit-— 
(w_fit_pl[wid_fit-— 
(w_fit_p2 [wid_fit-— 
(w_fit_p3 [wid_fit-— 
(w_fit_p4[wid_fit-— 








<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 


<reset_val 








next5 


?w_fit_r5:w_fit_p5; 


next6 


?w_fit_r6:w_fit_p6; 


next7 


?w_fit_r7:w_fit_p7; 


next8 


?w_fit_r8:w_fit_p8; 


next9 


wid_TT+wid_fit- 


wid_TT+wid_fit- 


wid_TT+wid_fit- 


wid_TT+wid_fit- 














wid_TT+wid_fit- 


?w_fit_r9:w_fit_p9; 


next10 
?w_fit 


wid_TT+wid_fit-l: 
O:w_fit_p10; 





next1l 
?w_fit 


wid_TT+wid_fit-1l: 
1l:w_fit_pll; 





next12 
?w_fit 


wid_TT+wid_fit- 
2:w_fit_pl2; 





next13 
?w_fit 


wid_TT+wid_fit- 
3:w_fit_p13; 





next14 
?w_fit 


wid_TT+wid_fit- 
4:w_fit_p14; 








next15 
?w_fit 


i 








wid_TT+wid_fit- 








Ss wlkit .pi'5; 





@ (posedge CLK) 


prev0d 
previd 
prev2d 
prev3d 
prev4d 
prev5d 
prevoéd 
previd 
prev8d 


prev9d 





prevl0d 


prevlld 


prevl2d 


prevl3d 


prevl4d 








previ5d 


<= prev0; 
<= prevl; 
<= prev2; 
<= prev3; 
<= prev4; 
<= prev5; 
<= prev6; 
<= prev7; 
<= prev8; 
<= prev9; 
<= prevl0; 
<= prevll; 
<= prevl2; 
<= prevl13 


<= prevl4; 





<= prevl15; 


r_next0O <= next0; 


r_nextl <= nextl; 


r_next2 <= next2; 
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0 


0 








<= 


((clearer 


((clearer 


((clearer 


((clearer 


((clearer 


((clearer 


((clearer 


((clearer 


((clearer 


((clearer 








((clearer 














(w_fit_p5 


(w_fit_p6 


(w_fit_p7 





(w_fit_p10 


(w_fit_pl1l 


fit pl2 


(w_fit_p13 


fFit_pl4 











wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 


wid_fit- 








r_next3 <= next3; 
r_next4 <= next4; 
ronext5 <= next5; 
r_next6 <= next6; 
r_next7 <= next7; 
r_next8 <= next8; 
r_next9 <= next9; 
r_nextl10 <= next10; 
r_nextll <= nextl1l; 
r_next1l2 <= next12; 
r_next13 <= next13; 
r_nextl14 <= next14; 
r_next15 <= next15; 
end 
endmodule 
module cross_unit (a, b, sel, CLK, w, x, y, Z); 
parameter n=14; 
input [n-1:0] a, b, sel; 
input CLK; 
output [n-1:0] w, x, y, 2Z; 
reg [n-1:0] w, x, y, Z; 
wire [n-1:0] c, d; 
defparam cl.n=n; 
erossckt. clla; b,; sel, ¢, d)y 
always @(posedge CLK) 
begin 
w <= a; 
x <= b; 
y <= CGF 
z <= d; 
end 
endmodule 
module bit_swap_B(a, b, ctrl, aprime, bprime); 


//Tf ctrl==1 then swap a and b 


input a, b; 


input clr ly 


122 


output aprime, bprime; 


reg aprime, bprime; 


always @(a, b, ctrl) 


begin 
aprime <= ctrl?b:a; 
bprime <= ctrl?a:b; 
end 
endmodule 


module muxl6tolB(A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, sel, 
parameter n=14; 
input [n-1:0] A, B, C, D, E, F, G, H, I, J, K, L, M, N, 0O, P; 
input [3:0] sel; 
output [n-1:0] Q; 


reg [n-1:0] Q; 


always @(A, B, C, D, E, F, G, H, I, J, K, L, M, N, 0, P, sel) 


case (sel) 





O: Q<=A; 
1: Q<=B; 
2: Q<=C; 
3: Q<=D; 
4: Q<=E; 
Q<=F; 
6: Q<=G; 
7: Q<=H; 
8: Q<=I; 
9: Q<=J; 
0: Q<=K; 
1: Q<=L; 
2: Q<=M; 
3: Q<=N; 
4: Q<=0; 
5: Q<=P; 


default: Q<=A; 
endcase 


endmodule 


module crossckt(a, b, ctrl, aprime, bprime) ; 


parameter n=8; 
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input [n-1:0] a, b, ctrl; 
output [n-1:0] aprime, bprime; 


wire [n-1:0] aprime, bprime; 


genvar k; 
generate 
for (k=0; k<n; k=k+1) 
begin: ea_bit 
bit_swap_B bs(a[k], b[k], ctrl[k], aprime[k], bprime[k]); 
end 
endgenerate 


endmodule 


module sel_cross(crc_adr, CLK, EN, reset, val); 
input [8:0] crc_adr; 
input CLK, EN, reset; 
output [31:0] val; 
wire [31:0] val; 


reg [7:0] adr; 


reg [7:0] sel_adr; 


crossover_mut_rom machine(sel_adr, val); 


always @(crc_adr, adr) 


sel_adr <= crc_adr[0]?crc_adr[8:1]:adr; 


always @(posedge CLK) 


begin 
if (reset) 
adr<=256'd0; 
eLee 
adr<=adr+EN; 
end 
endmodule 


module crossover_mut_rom(adr, val); 
//Creates a ROM to control crossover and mutation 
input [7:0] adr; 
output [31:0] val; 
reg [31:0] val; 


always @(adr) 
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begin 


case 


(adr) 
0: 
Les 


2: 
3 





val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 


val=32' 


b0000 
b0010 
b0100 
b0100 
b0011 
b1100 
b0100 
b0000 
b0000 
b0111 
b0010 
b1100 
b0001 
b0010 
b0110 


b0111 


b0101000100001100100100110 


b0001 
b1011 
b0001 
b1100 
b0000 
b0010 
b0101 
b0111 
b1001 
b0000 
b0000 
b0000 
b1100 
b0100 
b00 
b010 
b11 
b00 
b1000 





b01 


b000 





b010 





b1100 
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000100110010100 


001101000101100 
011000000011000 
011011000101001 


011001000010010 





011100001011010 


111000111001001 


011010000101010 





0 00010010101 


101001000101001 





100101100011010 
011000011010010 


01111011001000 








0011010101101 





001000110101000 





0100100000010 
01000011101000 


101001110010010 





01010010010010 


011010010101100 


001011101010100 





0001101001010 


001011010100000 


01100 100010 





0101010 








011100011000111 


001 








0010 








011001010001111 
001000000100000 
01101100100 


000010111000010 





be 


0000100001011 


011100000110100 


[e 3 


1000001000000 
111110110100110 


000000010111001 





010111100001001 


0011001001110; 
1110001111011; 
1011110001100; 
0001110011011; 
1100100000111; 
1010000110001; 


0100001110000; 





0111100100011; 
0010100110110; 
0101100001000; 





1111000011011; 
0001001011001; 


1010111011110; 





0000000010111; 





00 00010100; 





0100100110000; 


000111; 


0000001100 


oO 
s 


0001110000101; 


000111 








000110 








0010001100101; 
1010000011010; 





0011100110110; 
0001001 
0110011010010; 





0011 














001010100100; 
001000110101; 


111001 














001010110111; 





val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 


val=32' 


601001000000 


b0 
b0 
b0 
b0 
b1 
b0 
b0 
b0 
b0 
b0 
b0 
b0 
b0 


b00100001000010000 


b0 
b0 
b0 
b0 


60000010000 


b0 
b0 
b0 
b0 
b1 
b0 
b1 


b0 


00110 


1110011100010110 


01010 
0010 
1110 
0100 





01100 
0011 





010 


101 
0000 





010 





0010 


0100101000011000 


11100 
11101 
01010 
01100 
11100 





01000 
00001 


00010 


00001 
10001 
00010 
01000 
01110 
10100 
00001 


000000001010100 


00001 
11000 


0000110000001000011001 


100011010111010 


1000 


010 
000 
000 
00 





0100 
000 





0110 


600000101100 


600100001000 


600011001100 


101100 
00001 


010100 
01010 


0 00 
01100 
01100 











010010 





110010 








010100000000110010100110000 
11000001000001100100110110 


0000110 
0111000 


001000 
000011 
110100 
010101 
0011010 
000010 
0010100 
00 0 
010010 





001100 





b0011011010100 00 


D 


11000 


01010 


601100001000 


D 
D 
D 
D 


b0 





e) 


b01011000100101001 





b0 


0000 
1000 
0010 
0010 
0001 





1100 


00000 


126 





01110 
0001 
0000 
000 
0000 
001 








010 


0001000 





0100101 
1011000 
0001010 
010100 
101010 





1001010 
100011 





100110 


111001011101 
010110100110 
010100000000 
001011011101 


001010011111 





100100000111 
100101100000 
010011000010 


100101000110 
010001010010 
001011100000 





101010110010 
1100001011 








01010100110 
1001110100 





00111101100 





010011100100 
00011010100 





00011110011 
000000011000 
00100001110 
010110110010 
0111100101 
00011000110 





1000010111 
011011001100 
00101100100 





1001011110 








111000110 
00001000110 





00000010101 
110010000110 
00000110000 





00010100010 
011011001110 


010000001110 





011100100100 


01; 
10; 
10; 
00; 
00; 


00110000101000100110; 


00; 
00; 
10; 


010001000010000010; 


cL Le 
00; 
10; 
00; 
Ol; 


01; 





11; 
00; 
00; 
10; 
Ol; 
Ol; 
Ol; 
00; 
Ol; 





0; 
00; 











val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 


val=32' 


b10100011000000 
b00100000000101 


b01100001010001 


b11111110010001 


b01000000010101 


b11001001101000 


b010000111011100 





b01000111010100 


b101110010111010 





b011000100000100 


b100101010000000 
b100100000001001 
60101011 


1010101 





b001010110100100 


b101000101110011 


b111001001101000 


b001111010111010 


b001010100100000 


b001101001110000 


b010100000011100 








b011101001111001 


b100000110000000 


b001011001000010 


b001001000111011 





b100101010010010 


b001000001010010 


b001000000110011 





b111000010011001 





b010001100011000 


b000011110001101 


b001101001000010 


6100000010 010 


0011011000101010 
1110010100110100 
100110000100010 
110000011100101 
010000111000100 
000001011000101 


00001000101000 





010001011101000 
00110000010000 
001110000001010 


11010010011001 





10000010111110 


110011110011100 


001111011010000 


001101001000000 


0001100010101110 





00000100100110 





0100100110101011 


0100000100110101 





0100100100100011 


0101010000001110 


1011000101010100 





0101010111001 





0 010000000000 


00000011010100 





0011000011000010 


101001111000110 








001110110101110 


000011001011110 


011000011011000 





1000010100111000 





111000000101100 


0; 
0; 


, 





0; 
0; 
0; 
0; 
0; 
1; 
0; 
0; 
0; 


, 





1; 


0; 


val=32'b10 
val=32'b10100 


val=32'b00110 


val=32'b100000110 





val=32'b00111 
val=32'b00000 


val=32'b10010 


0001 





000 
010 
0010 





000 








100 
00000 


val=32'b00000001100 


val=32'b10000000001 





val=32'b00000001101 


127 


010100100000110 





0110001101000000 


0001010110000000 
0000000101011100 
001010000000000 


100100100001001 





0000101110110101 


0011010000101100 





101000010111010 





0011011101000010 


0100; 





0010; 
01; 
0101; 
001; 














val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 


val=32' 


b010100100100100000000110 
600111001011101001100010 


001000 
010001 


1; 


0; 


b01100011101000010111110000001000; 


b0110001101010010010001 


b000 


b0000010100100 


601010110000 


b0 


BR 


0010100 
61000001101 
60111100000 
600000100000 
60101100001 


61010000100 





60101100101 





b1 





60010000101 


b00100011000001110 


b10111000000 
b0100000000 
b01010010100 
b0000011100 
60110110 
b010 


101000 


010001100 


Ol 


0010 


000011100000010 


100000010011010 





01000000110 
110100100 


1100101010 





0000000101010 


0011010010 





011001010000 





001110000010 


1001000001100 





000 
0010001101000 
0100101 
1010100101011 
1010110 


010101000 0 





10010111010 





010011001 0 





0110010000010 





0111001000010 


101101 
111001 
010011 
001110 
110100 
011100 


010 








011011 
00001 


10000 
001010 


00111 





01110 
011001 


001100 





101110 


000110 


val=32'b10 


val=32'b00 
val=32'b10 
val=32'b0 
val=32'b00 
val=32'bl1 
val=32'b0 
val=32'b00 
val=32'b0 
val=32'b10 


val=32'bl 





val=32'bl 


val=32'b10 
val=32'b00 
val=32'b01 
val=32'b00 
val=32'b00 


val=32'b01 





val=32' 


128 


b01 


000001010000 





01001100100 


010011010000 


10001100 


010010011101 


01010100 








010111000010 





00100000110 





001110000000 
10100001000 
01000011000 
01000101111 
010001001 
0011100011 
0000100000 
0000001011 
01110010010 


1000010000 











1010110110 


10 
001 
00 


000000100010 


ol 


0000010110 





10 
000 
01 
010 
O01 
00 


1010000000101 


10 
000 
00 





00 
010 


010100 
000000010100 





001010110100 





000110 


1001110100110 


00001 


010111101000 





001011101011 





0011011010 


001000000100 


110000100100 





001011010000 
000 
100011000110 


10111010 





010010001100 





01111110 





0011011010000 


0 





111000; 


L 


010010; 


1 


0; 
0; 
0; 
1; 
0; 
1; 
0; 
0; 
0; 


000000011; 


1; 
0; 
0; 
0; 
0; 


10000001; 


0; 
0; 


01001001; 


1; 


01010000; 


1; 


0001; 


0; 


, 





, 


100011001000001010; 





200: 
201: 
202: 
203: 
204: 
205: 
206: 
207: 


66: 
67: 
68: 
69: 
70: 
aly 
123 
738 
74: 
eon. 
76: 
pe 
78: 
79: 
80: 
81: 
82: 
83: 
84: 
85: 
86: 
87: 
88: 
89: 
90: 
Ouse 
92:3 
9333 
94: 
95: 
96: 
OS 
98: 
993 


val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 


val=32' 


b00 
601010000101 
b00 
600000100100 
b00 


00111001 


600110000000 





600100101000 


60000010101 
b0100100000 
60010100100 
60001011101 


600100111000 





60010000101 


611000011100 
600010010000 
600000001001 


600100101000 





b00011 


ray 


100 
b11000 
610010100000 
61010000100 


61101010000 





600010110010 


b110100010 





610001001000 


b000 000 
6000110000 
6000101100 


60011100100 








b000 000 


601110100000 


60001010000 





b1100000001 





600111101100 





b01000101000 
600001011010 
600101010000 
610100100000 
601100001010 
600011010011 


601010010100 





600010010100 


129 


00110000001001000001 


10111010010 


00011001001 
10100011010 


11001010101 





00001101110 


01000110010 
00000110000 


1010000000 





0001101010 
1000000110 
1010010000 
1101000100 
0100010100 


0100101100 





1100111000 





00100100100 


0010000111001000010 


01000001111 
00000011110 
11111001000 
01000001001 
11001010100 


00010010100 





01010000000 
00100011100 


10100100010 





00000011001 


10010000001 
10101001101 
11001000010 
01011010010 


00111001001 


000000000 
000 


001001 


101010001 


01000011100 





001001100 





000100101 


00110001111 








0100101100 


10110111; 


0100110001; 


10010000100001011011001001001,; 


0101101100; 
0100001000; 
1100101110; 


0001100011; 





0110110001; 


1100010101; 





be 


11000101; 
0011011010; 
1001001000; 
0101101010; 
0101000000; 
1100110100; 
0010000101; 
0010000111; 
1110100101; 
100101; 
0100100001; 
01010110; 





0010100101; 





01011110; 
11001000; 








0101000011; 





000110100; 
0010010010; 
0101011010; 
1100011010; 
1010100100; 
1000000010; 
1001101000; 
0100011010; 
010110 0; 
100101011; 
00011 0; 
10110 


01011 ; 








0001001101; 





000100011; 


0101000001; 














101011111; 








val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 
val=32' 


val=32' 


6000101100 
6101001100 
60100000010 
6001110100 
60001011100 





6110101100 
60111010000 


00001011010000 
01100011000000 
1011111000001 
00000000100001 
1010001011010 
01100100010000 
0101000000001 


0001 
00110 
1010 
1110 
1010 
01001 





0111 


b01011011000010010010110000111 





61001000000 





61000001101 
61000110100 
61001010010 
60001010001 
60011010101 
60010000100 


b01101011010100000001010001111000; 


60100000100 
60000011100 
60010011110 
60000100100 
61001000100 
60101101000 


0010100110100 
1001001010000 
00010000010010 











0001011111101001100111; 


1010011010000 
1001000011000 


0110100101011000100011; 


1000110100110 
0001100100111 


00011 
00011 





01010 


0010000000101101100001,; 
101100000111010000101; 


01000 
10001 


01111 


00100 


b01000101000100001010110001111 


61100000100 
61010100101 
60010000110 
61011001000 
60000101110 





60100101100 
60110000000 
60110101000 
b011000000 
6000000110 
6001101011 
6000100111 





60111000110 








b110000110 
60000010100 





60011110110 
61010000000 
61101000001 





60010001101 
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1001111011010 





0001100011101 





1110001001001 
11000010010100 
00110100110100 
0000100111010 
1010100100101 
11000101110010 





1100110101000 





0010000010000 
100010100001 





01011001010000 
010101001110 
011001000110 





101001100010 
01001000011010 
01001100100100 








10010001011010 








01100001110000 


00110 
01010 
01010 
011 
010 
11000 





011 
10000 
0100 
01010 
0000 
0100 
0100 
01100 
1010 
01100 
0110 





0001 





01000 


00; 
00; 
011; 
10; 
000; 
00; 
000; 
01; 


011; 





00; 
011; 


100011101101010000000; 
010100000011110000010; 


101; 


100; 


011; 


011; 





001; 
00; 
000; 








011; 
000; 
000; 
100; 
101; 
101; 
001; 
001; 
001; 




















250: val=32'b00010000101000101110001101111111; 
2018 val=32'b01010011100000001011010001100111; 
252: val=32'b10111000000101010010011001111010; 
253: val=32'b00010010010000001100001101111001; 
254: val=32'b11000011010000001001010101111011; 
255%: val=32'b00110100100000010010110001010111; 


default: val=32'd0; 
endcase 
end 


endmodule 


module dff_cseNB(d, CLR, SET, EN, CLK, q); 
parameter n=32; 
input [n-1:0] d; 
input CLR, SET, EN, CLK; 
output [n-1:0] q; 
reg [n-1:0] q; 
always @(posedge CLR or posedge SET or posedge CLK or posedge EN) 
if (CLR) 
q<={n{1'bO}}; 
else if (SET) 
q<={n{l'bl}}; 
else if (EN) 
qa<=d; 


endmodule 


module dff_eNB(d, EN, CLK, q); 
parameter n=16; 
input [n-1:0] d; 
input EN, CLK; 
output [n-1:0] q; 
reg [n-1:0] q; 
always @(posedge CLK) 
if (EN) 
q<=d; 


endmodule 


module mux2tolB(A, B, sel, Q); 
parameter n=32; 
input [n-1:0] A, B; 
input sel; 


output [n-1:0] Q; 
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reg [n-1:0] Q; 


always @(A, B, sel) 
case (sel) 
O: Q<=A; 
1: Q<=B; 
default: Q<=A; 
endcase 


endmodule 


module dff_ceNB(d, CLR, EN, CLK, q); 
parameter n=16; 
input [n-1:0] d; 
input CLR, EN, CLK; 
output [n-1:0] q; 
reg [n-1:0] q; 


always @(posedge CLK or posedge CLR) 


begin 
if (CLR) 
q <= {16{1'b0}}; 
else if (EN) 
q <= d; 
end 
endmodule 


module MutAdr(inc, CLK, CLR, VALID, ITER, adr); 
parameter n=8; 
input inc, CLK, CLR, VALID, ITER; 
output [7:0] adr; 
wire [7:0] adr; 
wire [7:0] qsum; 


reg [7:0] sum; 


defparam outmux.n=n; 


mux2tolB outmux(sum, {n{1'b0O}}, CLR, adr); 


defparam ff.n=n; 


dff_eNB ff(sum, VALID&ITER, CLK, qsum); 


always @(inc or adr) 
sum <= inc + adr; 


endmodule 
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module MutRom(ADR, 


input 
output 


reg 


//Format: 


element selection. 


always @(ADR) 


begin 


val); 


[7:0] ADR; 


[24:0] val; 


[24:0] val; 


case 


7 bits of O-run length, 


(ADR) 























wid_TT bits of mutator, 





























0: val=25'b0000010000000001000000011,; 
1: val=25'b0000001000000000000001001,; 
28 val=25'b1111111000001001000001000; 
3 val=25'b0011101000000000000000010; 
4: val=25'b0010001000000000000000000; 
val=25'b00 00011000110100100000; 

6 val=25'b0110010010000001100001100; 
7 val=25'bl 1000000000000000001; 
8 val=25'b0011010000000101001001010; 
9 val=25'b0100010000010000000001010; 
0 val=25'b0 01001000000001001110; 

1 val=25'b0000110000000000000011110; 

2 val=25'b1100011000010000000101111; 

3 val=25'bl 000000000000001110; 

4: val=25'b0001110001101000000000001,; 

val=25'b1110011010000100100010001,; 

6: val=25'b0 00010001001000010110; 

des val=25'b0 000010100000001001; 

8: val=25'b1101011100000000001000011,; 

9: val=25'b1110001001000000000001100; 

20 val=25'b0111011000001000000101100; 
21, val=25'bl 000000000000001101; 
22 val=25'b0100001000100000100000101,; 
23 val=25'b0000 000000000010001111; 
24 val=25'b 000000000000000011; 
PA) val=25'b 000000000000001001; 
26 val=25'b10 01000000000000001000; 
27 val=25'b1011000000001000000001010; 
28 val=25'b1l1 11000000000000000111,; 
2 val=25'b1l 11000000000000001010; 
30: val=25'b0001100000000000000001100; 


4 bits of which 





val=25'b0111 
val=25'b1111 
val=25'b0111 


val=25'b1111 


111000101100000000010; 


111000000000000000001; 


101000010000001000010; 


111000000000000000101; 


val=25'b1011000100000000001001000; 


val=25'b0100 


val=25'b0010 


001001000011000010000; 


01000000000000101000; 


val=25'b0100010100101000000101101; 





val=25'b1111 
val=25'b0101 
val=25'b0000 
val=25'b0100 
val=25'b1000 





val=25'b1110 
val=25'b0000 
val=25'b0010 
val=25'b10 
val=25'b01 
val=25'b01 
val=25'b1l 
val=25'b1l 
val=25'b000 
val=25'b00 
val=25'bl 
val=25'b0010 
val=25'b0 
val=25'b 0 
val=25'b10 
val=25'b0100 
val=25'b 
val=25'b 
val=25'b 
val=25'b00 


val=25'b 





val=25'b 
val=25'b0 


val=25'b000 








val=25'bl 


val=25'b0100110010100100000000110; 


val=25'b0100 


val=25'b0100 








ary 








00000000000000001 
01010000010000000 
00000010000000000 


0100000100000000 
100000000000000 


oO 


100000000000000 





100000000000000 





0001000010000000 





000000000000000110; 
00000010010000000101; 
001000000000001000000; 
0000000000000000000; 


OLT; 
111; 


100; 


0000001000000010100; 


001; 
000; 


0000010000000000011,; 


001; 
100; 


110; 


000001001000100000001; 


> 


01100000000000000 


0000000000000000001,; 


101; 


000001000000001010101; 


00000000000000 


00000100000000 


00000000000000 


0101000010001000 
00000000000000 











00000000000000 
00010000010000010 


000000001000000000000; 


11100000000000000 


00101000000000000 
01100100000000000 





000000000000000011; 
000000000000000110; 





111; 


010000000000000000000; 


000; 
011; 





O11; 
101; 
O11; 
O11; 


001; 


011; 





011; 


val=25'b0111001000000100000110100; 
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73: 
74: 
Toe 
76: 
ae 
78: 
79: 
80: 
81: 
82: 
83: 
84: 
85: 
86: 
87: 
88: 
89: 
90: 
91: 
92% 
93: 
94: 
95: 
96: 
Ole 
98: 
99:3 





00: 
Ol: 
02: 
03: 
04: 
05: 
06: 
O7: 
08: 
09: 
10: 
11: 
12:3 
13'3 
14; 


val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 


val=25' 


b0110000000001000100000100; 


b0000 
b1 


b0000 


b010000100100000000000 








D 


b000 





b0110 
b100 





b00 
b000 
b000 


Baal 





b000 


b001000000000000100000 


b0010 
b111 


b010 


b000000001000001000000 


b000 
b1 
b0110 
b010 
b1 


b1 





b1100 











b00 
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0 





oO 


0 
1 
0 


000000000000010 
00000000000000 
0100000001000000 
00000000000000 


00000000000000 
0000000000000000 
0101000000000000 
000000000000000 





0100000000011001 
000000000100100 
000000000000000 
01000000000000000 
000000000000000 
000000000000000 
000000000000001 
000001000001100 
001000010101000 
000000000000000 





00000000000000 
000110000000010 





000000100000000 
0000000000000000 





0001000000100001 


010; 
000; 


011; 





00; 
001; 
Ol; 
001; 
Ol; 





011; 
001; 








111001000000000000010; 


100000100000001100000; 


111000000000000000100; 


011100100100000000010; 


[3 





0 


| 


0000000000100010 
11000000000000000 
0000000010001000 


0100000000000000 
0000000000000000 
000000000000000 
0110000100000011 





100000000000000 
1000000000000000 
1000000000100000 





1000000000000001 


100; 
001; 
000; 
100; 
100; 
011; 
101; 
LOLs 
011; 
011; 


010; 





001; 
110; 








val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 


val=25' 











b1111 
b0011 
b01100 
b 1 
b 1 
b1100 
b00100 
b11010 
b10000 


0 


0 





0 


1000000000000001010; 
1000000000000001001; 
0000000000010000 ; 
1000000000000001011; 
1000000000000000110; 
0000001000010000001,; 





000000010000000 0; 





000000000110001001; 
000000000000001100; 


b1000100100100001000001110; 


b01010 





b00010 
b00010 
b010 





b000 
b0010 
b10100 
b1001 





b0011 


b1001001000000000100000010; 
b0000001000001000000010101; 





0 


0 


0 





0 
0 


000100100001101111; 
000000000000001101; 
000000000000000000; 





00000000000000 





000010000000101010; 





010000000001000011; 
0000000000001000001,; 
0010000000000010 ; 
0000000010000000101,; 
0001000000000001011,; 
0000000000000000101,; 
1000000010010001 ; 











b1110000000100001000001100; 


b0100 
b0000 
b0110 
b1 
b1 
b1 
b01100 
b1 


b00110 








b0 


0 
0 


0000100001000001010; 
001000000000110000; 
000000000000001111; 
000000000000000100; 
000000000000001010; 
000000000000000101; 
000000000000001101; 
000000000000001001; 





000000001000000010; 
0100000000000001010; 


b1001000000001100000001000; 


b00100 


0000000000000001111,; 


b0111101000000001000101100; 


b00101 


0000000000010101000; 


b0001101000000000000001011; 





b10 


ra 











0100001000000000010; 





b0010000010000001000001100; 


136 





aa es 
58: 
SO 
60: 
61: 
62: 
63: 
64: 
65: 
66: 
67: 
68: 
69: 
70: 
par. 
T2s 
Loe 
74: 
Le 
76: 
TTS 
78: 
79: 
80: 
81: 
82: 
83: 
84: 
85: 
86: 
87: 
88: 
89: 
90: 
91: 
92: 
93: 
94: 
95%: 
96: 
Ore 
98: 


val=25'b00 
val=25'b0 
val=25'b0 
val=25'b0 
val=25'b00 
val=25'b0 
val=25'b10 


val=25'b 





val=25'b0 
val=25'b10 
val=25'b10 
val=25'b 
val=25'b0 
val=25'b 
val=25'b 
val=25'b10 
val=25'b0 
val=25'b10 
val=25'b 
val=25'b 


val=25'b00 





val=25'b 


val=25'b00 





val=25'b0 
val=25'b10 
val=25'b00 
val=25'b10 
val=25'b01 
val=25'b00 
val=25'b00 
val=25'bl1l 
val=25'b10 
val=25'b00 
val=25'b01 
val=25'b1l 
val=25'b00 
val=25'b00 
val=25'bl1l 
val=25'b1l 
val=25'b00 


val=25'b00 





val=25'bll 
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101000000001000000 
0101000000000100001 
100100000000000011 
110000000100000000 
001000000000001000 
0000010001001000000 








00100100000010000 
0010000001000001000 
00010000000111000100 
000100000001000000 
0100100000000000000 
00000000000000 





0100001000000000 
000000000000000 





000000000000000 
00100000000100000000 
0 010001000000000 
01100000000100000001 
1 000000000000000 
1 000000000000000 
10001000000000000001 





1 000000000000001 





10 000000000000101 
00001000000000000001 





00101000000001000010 
01100000000010000000 
00100000000000000001 
01100011001010000001 
10011100000100001000 
00100000000100000000 
0011100000000000000 





1011100000000110000 
0100100100010000101 
0000100000000000000 





1111100000000000000 





1010000000001100001 
01100001000001000100 
11111000000000000000 
11111000000000000000 





01001000000000000001 
00010000010000010011 
10110000000001000001 


001; 
011; 
10; 
01; 
001; 
0; 
OL0; 





(> ae > | 


01; 





, 


000; 





Ol; 
00; 
00; 
01; 
001; 
00; 
000; 





010; 
001; 
011; 





001; 


010; 
01; 
010; 





01; 








0; 





val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 
val=25' 


val=25' 


b10001110010010000000011 
b11 10000000000000010 
b01001000000000000000101 
b11 10000000000000001 
b000011100000100000010110 
bO1 00000000000000000 








b00100100000000000100000 








b100110000000000000000110 
b011000100000000000001000 
b001000110000001010000100 
b010000000001100000000 
b0 0110000000000000110 





b1 11100000000000000110 
b01010001000000000001000 
b1110001001000000000000 























b0 011000000000000001 
b1 111000000000000000 
b011010100000100000000000 
b0100111000000000001000 
b00000100000000010000000 
b0001010000000000000001 
b00000100001000001000010 
b 000000000000000 
b 00100000000000000000 
b 000000000000000 
b 0000000000000000 
b 00000000000000010 
b 00000000000000000 
b001100100000000000000100 
b 00000000000000100 
b010110000000000000000001 








b100 00000001010000001 
b011110000000000000000101 
b111110000000000000000010 
b000100000000000001000010 








b110 100000001000000111 





b010101000000000000000101 
b001000100010001000000010 
b001111110000000000000001 
b001100000000010001000100 
b001001000100100000000010 











b100101010000000000000111 
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1; 
0; 
0; 
0; 
i 
i 
H 
H 
i 


0; 





0; 
0; 


0; 
0; 
0; 
0; 
1; 
0; 
1; 
1; 
0; 
0; 
1; 
1; 
1; 
0; 
0; 
0; 

















241: val=25'b1001101000100000000000101,; 
242: val=25'b0011101000000000000001011; 
243 val=25'b0011100000100000000000001,; 
244: val=25'b1111111000000000000001100; 
245: val=25'b1001011000000000000000001,; 
246: val=25'b1111111000000000000000101,; 
247: val=25'b0100000000000000000000100; 
248: val=25'b0000000000001000000001011; 
249: val=25'b0011100000110000001001100; 
250: val=25'b1010010010000000001000111,; 
201: val=25'b1111111000000000000000110; 
252: val=25'b0101111000000000000001001,; 
253: val=25'b0000110010011000000011001,; 
254: val=25'b0100011000000000000101011,; 
29.93 val=25'b1110111000000000000000001,; 














default: val=25'd0; 
endcase 
end 


endmodule 


module incer(val, CLR, EN, CLK, out); 
parameter n=8; 
input val, CLR, EN, CLK; 
output [n-1:0] out; 
reg [n-1:0] out; 
always @(posedge CLK or posedge CLR) 
if (CLR) 
out <= {n{1'b0}}; 
else if (EN) 
out <= out + val; 


endmodule 


module muter(crc_adr, CLR, VALID, ITER, CLK, r_mutout); 


parameter wrom=25; //number of bits in ROM lines, equals {7'Zeros run, 
14'Mutation code, 4'element selection} 


parameter alines=8; //number of bits in ROM address line 


parameter maxrun=7; //number of bits in countre 


input [8:0] crc_adr; 
input CLR, VALID, ITER, CLK; 
output [17:0] r_mutout; 


reg [17:0] r_mutout; 
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wire [17:0] mutout; 


wire [wrom-1:0] romout; 
wire w_isnequal; 


wire [alines-1:0] wq_adr; 





wire [maxrun-1:0] wq_ctr; 


reg r_isequal, d_isequal; 


reg [7:0] rom_adr; 


assign w_isnequal = ~r_isequal; 


defparam inc_ctr.n=maxrun; 


incer inc_ctr(~r_isequal, d_isequal|CLR, VALID&ITER, CLK, 


always @(wq_ctr, romout [wrom-1:wrom-maxrun] ) 


r_isequal <= wq_ctr == romout [wrom-1:wrom-maxrun] ; 


always @(posedge CLK)// or posedge CLR) 
begin 
d_isequal <= r_isequal; 
r_mutout <= mutout; 


end 


defparam inc_adr.n=alines; 


incer inc_adr(d_isequal, CLR, VALID&ITER, CLK, wq_adr)j; 


always @(crc_adr, wq_adr) 


rom_adr <= crc_adr[0] ?crc_adr[8:1]:wq_adr; 


MutRom mr(rom_adr, romout); 


defparam m_mutout.n=wrom-maxrun; 


wq_ctr); 


mux2tolB m_mutout ({ (wrom-maxrun) {1'bO}}, romout [ (wrom-maxrun-1) :0], 
r_isequal, mutout); 


endmodule 


module 


crossmut (w_sort0O, w_sortl, w_sort2, w_sort3, w_sort4, 


w_sorts5, 


w_sort7, w_sort8, w_sort9, w_sortl0, w_sortll, w_sort12, w_sort13, w_sortl14, 
crc_adr, crosscode, CLR, VALID, ITER, CLK, r_mout0, r_moutl, r_mout2, 
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r_mout3, 


w_sorté6, 
w_sortl15, 
r_mout4, 


r_mout5, xr_mout6, r_mout7, r_mout8, r_mout9, r_moutl10, r_moutll, 


r_mout14, r_mout15); 
parameter wid_TT=14; 


parameter wid_fit=8; 


input [21:0] 


w_sort15; 


input [8:0] crc_adr; 


w_sorto, 
w_sort6, w_sort7, w_sort8, w_sort9, w_sortl0, w_sortll, w_sortl12, 


w_sortl, w_sort2, 


input [13:0] crosscode; 


//cre adr has the format of 8:1 adr lines, 


input CLR; 


input VALID, ITER; 


input CLK /* synthesis syn_noclockbuf=1 syn_maxfan=100000 */; 


output [43°04] r_mo 


ut0O, r_moutl, r_mout2, 


r_mout6, r_mout7, r_mout8, r_mout9, r_moutl0, r_mouti1l, 


r_mout15; 


reg [13:0] r_mout0, 
r_mout7, r_mout8, r_mout9, r_moutl10 


wire [13:0] w_mout0, 
w_mout7, w_mout8, w_mout9, w_mout10 


wire [13:0] w_out0O 


r_moutl, r_mout2, r_mout3, 
, Y_moutll, r_mout12, r_mout13, 


w_moutl, w_mout2, w_mout3, 
, w_moutll, w_mout12, w_mout13, 


, Ww_outl, w_out2, w_out3, 
w_outl4, 


w_out7, w_out8, w_out9, w_out10, w_outll, w_out12, w_out13, 





wire [31:0] sel; 


wire [wid_TT-1:0] w_m0, w_ml, w_m2, w_m3, w_m4, 


w_sort3, 


0 control line 


r_mout3, 
r_mout12, 


r_mout4, 


w_mout4, 


w_out4, 


w_m5, 





reg [wid_TT-1:0] r_m 


0, r_ml, r_m2, r_m3, r_m4, 


rims, 





reg w_selmut0O, w_selmutl, w_selmut2, w_selmut3, 


w_selmut6, w_selmut7, 
w_selmut13, w_selmut14, w_selmut15; 


reg [13:0] w_muted0, w_mutedl, w_muted2, w_muted3, 
w_muted12, 


w_selmut8, 


w_selmut9, 


w_selmut10, 


w_muted6, Ww emuted7, w_muted8, w_muted9, w_mutedl0, w_mutedll, 


w_muted14, w_muted15; 


//sel_cross(CLK, EN, 


sel_cross selx(crc_a 


//wire [wid_TT-1:0] 
//assign crosscode = 


//assign crosscode = 


reset, val); 


dr, CLK, VALID&ITER, CLR, 


crosscode; 
14'b11111111100000; 


14'b00000011111111; 


sel); 


w_selmut4, 
w_selmutl1l1, 


r_mout13, 


w_sort5, 
w_sortl14, 


r_mout5, 
r_moutl14, 


r_mout6, 


r_mout15; 


w_mout6, 


w_mout15; 


w_out6, 


w_selmut5, 
w_selmut12, 


w_muted5, 


w_muted13, 


wire [17:0] w_mutout; 


//muter(CLR, VALID, ITER, CLK, r_mutout); 
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muter mtr(crc_adr, CLR, VALID, ITER, CLK, w_mutout); 


defparam m0.n=wid_TT; 


defparam ml.n=wid_TT; 


defparam m2.n=wid_TT; 


defparam m3.n=wid_TT; 


defparam m4.n=wid_TT; 


defparam m5.n=wid_TT; 


defparam m6.n=wid_TT; 











defparam m7.n=wid_TT; 


mux1l6tolB m0 (w_sort0O [wid_TTt+wid_fit-l:wid_fit], 


l:wid_fit], w_sort2 [wid_TT+wid_fit-—l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6 [wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], 


w_sort3 [wid_TT+wid_fit- 
w_sort5 [wid_TT+wid_fit-— 
w_sort7 [wid_TT+wid_fit-— 
w_sort9[wid_TT+wid_fit-— 


w_sort10[wid_TT+wid_fit-l:wid_fit], w_sort11 [wid_TT+wid_fit- 
w_sort12 [wid_TT+wid_fit-—l:wid_fit], w_sort13 [wid_TT+wid_fit- 
w_sort14 [wid_TT+wid_fit-—l:wid_fit], w_sort15 [wid_TT+wid_fit-l:wid_fit], 
w_m0) ; 


mux1l6to1lB ml (w_sort0 [wid_TT+wid_fit-l:wid_fit], 


l:wid_fit], w_sort2 [wid_TT+wid_fit-—l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6[wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], 
w_sort10[wid_TT+wid_fit-l:wid_fit], 


w_sort3 [wid_TT+wid_fit- 
w_sort5 [wid_TT+wid_fit-— 
w_sort7 [wid_TT+wid_fit-— 
w_sort9[wid_TT+wid_fit-— 
w_sort11l [wid_TT+wid_fit-— 


w_sort12 [wid_TT+wid_fit-—l:wid_fit], w_sort13 [wid_TT+wid_fit- 
w_sort14 [wid_TT+wid_fit—-l:wid_fit], w_sort15[wid_TT+wid_fit—-l:wid_fit], 
w_ml1); 


mux1l6to1lB m2 (w_sort0 [wid_TT+wid_fit-l:wid_fit], 


l:wid_fit], w_sort2 [wid_TT+wid_fit-—l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6 [wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], 
w_sort10[wid_TT+wid_fit-—-l:wid_fit], 


w_sort3 [wid_TT+wid_fit-— 
w_sort5 [wid_TT+wid_fit-— 
w_sort7 [wid_TT+wid_fit-— 
w_sort9[wid_TT+wid_fit-— 
w_sort11l[wid_TT+wid_fit-— 


w_sort12 [wid_TT+wid_fit-l:wid_fit], w_sort13 [wid_TT+wid_fit- 
w_sort14 [wid_TT+wid_fit-—l:wid_fit], w_sort15 [wid_TT+wid_fit-—l:wid_fit], 
w_m2); 


l:wid_fit], w_sort2 [wid_TT+wid_fit-—l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6[wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit—-l:wid_fit], 


mux1l6to1lB m3 (w_sort0 [wid_TT+wid_fit-l:wid_fit], 


w_sort3 [wid_TT+wid_fit-— 
w_sort5 [wid_TT+wid_fit-— 
w_sort7 [wid_TT+wid_fit-— 
w_sort9[wid_TT+wid_fit-— 


w_sort10[wid_TT+wid_fit-l:wid_fit], w_sort11 [wid_TT+wid_fit- 
w_sort12 [wid_TT+wid_fit-—l:wid_fit], w_sort13 [wid_TT+wid_fit- 
w_sort14 [wid_TT+wid_fit-l:wid_fit], w_sort15 [wid_TT+wid_fit-—l:wid_fit], 
w_m3); 


l:wid_fit], w_sort2 [wid_TT+wid_fit-—l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6[wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], 
w_sort10[wid_TT+wid_fit-—-l:wid_fit], 


mux1l6to1lB m4 (w_sort0 [wid_TT+wid_fit-l:wid_fit], 


w_sort3 [wid_TT+wid_fit-— 
w_sort5 [wid_TT+wid_fit-— 
w_sort7 [wid_TT+wid_fit-— 
w_sort9[wid_TT+wid_fit- 
w_sort11l[wid_TT+wid_fit-— 


w_sort12 [wid_TT+wid_fit-l:wid_fit], w_sort13 [wid_TT+wid_fit- 
w_sort14 [wid_TT+wid_fit-l:wid_fit], w_sort15[wid_TT+wid_fit-—l:wid_fit], 
w_m4); 











l:wid_fit], w_sort2 [wid_TT+wid_fit-l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6[wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], 
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mux1l6to1lB m5 (w_sort0 [wid_TT+wid_fit-l:wid_fit], 


w_sort3 [wid_TT+wid_fit-— 
w_sort5 [wid_TT+wid_fit-— 
w_sort7 [wid_TT+wid_fit-— 
w_sort9[wid_TT+wid_fit-— 


w_sortl [wid_ 


w_sortl [wid_ 


w_sortl[wid_ 


w_sortl [wid_ 


w_sortl [wid_ 


w_sortl [wid_ 


TT+wid_fit- 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
sel[31:28], 


TT+wid_fit- 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
sel[27:24], 





TT+wid_fit- 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
sel[23:20], 


TT+wid_fit- 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
sel[19:16], 


TT+wid_fit- 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 
sel[15:12], 





TT+wid_fit- 
:wid_fit], 
:wid_fit], 
:wid_fit], 
:wid_fit], 








w_sort10[wid_TT+wid_fit—-l:wid_fit], w_sort11[wid_TT+wid_fit-l:wid_fit], 
w_sort12 [wid_TT+wid_fit-l:wid_fit], w_sort13[wid_TT+wid_fit-—l:wid_fit], 
w_sort14[wid_TT+wid_fit-—-l:wid_fit], w_sort15[wid_TT+wid_fit-l:wid_fit], sel[11:8], w_m5); 


mux1l6tolB m6 (w_sort0O [wid_TTt+wid_fit-1l:wid_fit], w_sortl [wid_TT+wid_fit- 











l:wid_fit]l, w_sort2 [wid_TT+wid_fit-l:wid_fit], w_sort3 [wid_TT+wid_fit-l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], w_sort5 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6[wid_TT+wid_fit-—l:wid_fit], w_sort7 [wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], w_sort9[wid_TT+wid_fit-—l:wid_fit], 
w_sort10[wid_TT+wid_fit-l:wid_fit], w_sort11 [wid_TT+wid_fit-—-l:wid_fit], 
w_sort12 [wid_TT+wid_fit-l:wid_fit], w_sort13 [wid_TT+wid_fit-—-l:wid_fit], 


w_sort14 [wid_TT+wid_fit-—l:wid_fit], w_sort15[wid_TT+wid_fit-—-l:wid_fit], sel[7:4], w_m6); 


mux1l6tolB m7 (w_sort0O [wid_TT+wid_fit-l:wid_fit], w_sortl [wid_TT+wid_fit- 

















l:wid_fit], w_sort2 [wid_TT+wid_fit-l:wid_fit], w_sort3 [wid_TT+wid_fit-—l:wid_fit], 
w_sort4 [wid_TT+wid_fit-—l:wid_fit], w_sort5 [wid_TT+wid_fit-—l:wid_fit], 
w_sort6 [wid_TT+wid_fit-—l:wid_fit], w_sort7 [wid_TT+wid_fit-—l:wid_fit], 
w_sort8 [wid_TT+wid_fit-—l:wid_fit], w_sort9[wid_TT+wid_fit-—l:wid_fit], 
w_sort10[wid_TT+wid_fit-—l:wid_fit], w_sort11[wid_TT+wid_fit-l:wid_fit], 
w_sort12 [wid_TT+wid_fit-l:wid_fit], w_sort13 [wid_TT+wid_fit-—l:wid_fit], 




















w_sort14[wid_TT+wid_fit-l:wid_fit], w_sort15[wid_TT+wid_fit-l:wid_fit], sel[3:0], w_m7); 


defparam mutmux0.n=wid_TT; 


defparam mutmuxl.n=wid_TT; 


defparam mutmux2.n=wid_TT; 


defparam mutmux3.n=wid_TT; 


defparam mutmux4.n=wid_TT; 


defparam mutmux5.n=wid_TT; 


defparam mutmux6.n=wid_TT; 


defparam mutmux7.n=wid_TT; 


defparam mutmux8.n=wid_TT; 





defparam mutmux9.n=wid_TT; 


defparam mutmux10.n=wid_TT; 


defparam mutmuxll.n=wid_TT; 


defparam mutmux12.n=wid_TT; 
3 


defparam mutmux13.n=wid_TT; 


defparam mutmux14.n=wid_TT; 








defparam mutmux15.n=wid_TT; 








mux2tolB mutmux0 (w_out0, w_muted0, w_selmut0, w_mout0); 
mux2tolB mutmuxl (w_outl, w_mutedl, w_selmutl, w_moutl); 
mux2tolB mutmux2 (w_out2, w_muted2, w_selmut2, w_mout2); 
mux2tolB mutmux3(w_out3, w_muted3, w_selmut3, w_mout3); 
mux2tolB mutmux4 (w_out4, w_muted4, w_selmut4, w_mout4); 
mux2tolB mutmux5 (w_out5, w_muted5, w_selmut5, w_mout5); 
mux2tolB mutmux6 (w_out6, w_muted6, w_selmut6, w_mout6); 
mux2tolB mutmux7 (w_out7, w_muted7, w_selmut7, w_mout7); 
mux2tolB mutmux8 (w_out8, w_muted8, w_selmut8, w_mout8); 
mux2tolB mutmux9 (w_out9, w_muted9, w_selmut9, w_mout9); 
mux2tolB mutmuxl10(w_out10, w_muted10, w_selmut10, w_mout10); 
mux2tolB mutmuxl1l(w_out11, w_mutedll, w_selmutl1l, w_mout11); 
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w_out3); 


w_out7); 


w_outl1l1); 


w_outl15); 


mux2tolB mutmuxl12 (w_out12, w_muted1l2, w_selmut12, 
mux2to1lB mutmux13(w_out13, w_muted13, w_selmut13, 
mux2tolB mutmuxl14 (w_out14, w_mutedl4, w_selmutl14, 


mux2tolB mutmux15(w_out15, w_muted15, w_selmut15, 


defparam cu0.n=wid_TT; 


cross_unit cu0(r_m0, r_ml, crosscode, 


defparam cul.n=wid_ITT; 


cross_unit cul(r_m2, rm3', crosscode, 


defparam cu2.n=wid_TT; 


cross_unit cu2(r_m4, r_m5, crosscode, 


defparam cu3.n=wid_TT; 


cross_unit cu3(r_m6, r_m7, crosscode, 


always @(posedge CLK) 

begin 
r_m0 <= w_m0; 
r_ml <= w_ml; 
r_m2 <= w_m2; 
r_m3 <= w_m3; 
r_m4 <= w_m4; 
rm5 <= w_m5; 
r_m6 <= w_m6; 
r_m7 <= w_m7; 
r_mout0O <= w_mout0; 
r_moutl <= w_moutl1; 
r_mout2 <= w_mout2; 
r_mout3 <= w_mout3; 
r_mout4 <= w_mout4; 
r_mout5 <= w_mout5; 
r_mout6 <= w_mout6; 
r_mout7 <= w_mout7; 
r_mout8 <= w_mout8; 
r_mout9 <= w_mout9; 
r_moutl10 <= w_mout10; 
r_moutlil <= w_mout11; 


r_moutl2 <= w_mout12; 
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CLK, 


CLK, 


CLK, 


CLK, 


w_outo, 


w_out4, 


w_outs, 


w_outl2, 


w_mout12) ; 
w_mout13); 
w_mout14); 


w_mout15); 


w_outl, 


w_out5, 


w_outg9, 


w_outl13, 


w_out2, 


w_out6, 


w_outl10, 


w_outl4, 


r_mout13 <= w_mout13; 
r_moutl14 <= w_mout14; 
r_mout15 <= w_mout15; 


end 


always @(*) 





























begin 
w_selmutO <= (0==w_mutout [3:0 
w_selmutl <= (1==w_mutout [3:0 
w_selmut2 <= (2==w_mutout [3:0 
w_selmut3 <= (3==w_mutout [3:0 
w_selmut4 <= (4==w_mutout [3:0 
w_selmut5 <= (5==w_mutout [3:0 
w_selmut6 <= (6==w_mutout [3:0 
w_selmut7 <= (7==w_mutout [3:0 
w_selmut8 <= (8==w_mutout [3:0 
w_selmut9 <= (9==w_mutout [3:0 
w_selmut10 <= (10==w_mutout[3: 
w_selmutll <= (11==w_mutout[3: 
w_selmut12 <= (12==w_mutout[3: 
w_selmut13 <= (13==w_mutout[3: 
w_selmutl4 <= (14==w_mutout[3: 
w_selmut15 <= (15==w_mutout[3: 
w_muted0 = w_outO * w_mutout 
w_mutedl = w_outl * w_mutout 
w_muted2 = w_out2 * w_mutout 
w_muted3 = w_out3 * w_mutout 
w_muted4 = w_out4 * w_mutout 
w_muted5 = w_out5S “* w_mutout 
w_muted6 = w_lout6 * w_mutout 
w_muted7 = w_out7 * w_mutout 
w_muted8 = w_out8 * w_mutout 
w_muted9 = w_lout9 * w_mutout 
w_muted10 = w_out10 * w_mutout 
w_muted11 = w_outll * w_mutout 
w_muted12 = w_out1l2 * w_mutout 
w_muted13 = w_out13 * w_mutout 
w_muted14 = w_outl4 * w_mutout 
w_muted15 = w_loutl5 * w_mutout 
end 
endmodule 
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module swapperN(a, b, lt, aprime, bprime); 
parameter n=16; 
input [n-1:0] a, b; 
input lt; 
output [n-1:0] aprime, bprime; 
reg [n-1:0] aprime, bprime; 


always @(a, b, lt) 


begin 
ee cay OE | 
begin 
aprime <=b; 
bprime <=a; 
end 
else 
begin 
aprime <=a; 
bprime <=b; 
end 
end 
endmodule 


module dff_NB14(d, CLK, q); 
parameter n=14; 
input [n-1:0] d; 
input CLK; 
output [n-1:0] q; 
reg [n-1:0] q; 
always @(posedge CLK) 

q <= d; 


endmodule 


module dff_NB8(d, CLK, q); 

parameter n=8; 

input [n-1:0] d; 

input CLK; 

output [n-1:0] q; 

reg [n-1:0] q; 

always @(posedge CLK) 
q <= d; 


endmodule 


146 


module dff_N(d, CLK, q); 
parameter n=16; 
input [n<190] d; 
input CLK; 
output [n-1:0] a; 


reg [n-1:0] q; 


always @(posedge CLK) 
q <= d; 


endmodule 


module compare_lt(a, b, 1t); 
parameter n=8; 
input [m=-1:0] a, b; 
output lt; 


reg lt; 


always @(a,b) 
if (a<b) 
lt<=1'bl; 
else 
1t<=1'b0; 


endmodule 


module sort2(a, b, aprime, bprime); 
//Configuration of inputs {8bit index, 8bit fitness value} 
//parameter n=16; 
//parameter k=8; 
parameter wid_TT=14; 
parameter wid_fit=8; 
input [wid_TT+twid_fit-1:0] a, b; 
output [wid_TT+wid_fit-1:0] aprime, bprime; 
wire [wid_TT+wid_fit-1:0] aprime, bprime; 


wire lt; 


compare_lt comp(a[wid_fit-1:0], b[wid_fit-1:0], 1t); 


defparam s.n=wid_TT+wid_fit; 


swapperN s(a, b, 1t, aprime, bprime) ; 


endmodule 


module sort4 (a, 
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aprime, 
bprime, 
cprime, 


dprime) ; 


parameter wid_TT=14; 


parameter wid_fit=8; 


input [wid_TT+wid_fit-1:0] a; 
input [wid_TT+wid_fit-1:0] b; 


input [wid_TT+wid_fit-1:0] c; 
































input CLK; 
input [wid_TT+wid_fit-1:0] d; 

output [wid_TT+wid_fit-1:0] aprime; 
output [wid_TT+wid_fit-1:0] bprime; 
output [wid_TT+wid_fit-1:0] cprime; 
output [wid_TT+wid_fit-1:0] dprime; 
wire [wid_TT+wid_fit-1:0] XLXN_1; 

wire [wid_TT+wid_fit-1:0] XLXN_2; 

wire [wid_TT+wid_fit-1:0] XLXN_3; 

wire [wid_TT+wid_fit-1:0] XLXN_4; 

wire [wid_TT+wid_fit-1:0] XLXN_9; 

wire [wid_TT+wid_fit-1:0] XLXN_10; 
wire [wid_TT+wid_fit-1:0] XLXN_11; 
wire [wid_TT+wid_fit-1:0] XLXN_12; 
wire [wid_TT+wid_fit-1:0] XLXN_13; 
wire [wid_TT+wid_fit-1:0] XLXN_14; 
wire [wid_TT+wid_fit-1:0] XLXN_15; 
wire [wid_TT+wid_fit-1:0] XLXN_16; 
wire [wid_TT+wid_fit-1:0] XLXN_36; 
wire [wid_TT+wid_fit-1:0] XLXN_37; 
wire [wid_TT+wid_fit-1:0] XLXN_40; 
wire [wid_TT+wid_fit-1:0] XLXN_41; 
wire [wid_TT+wid_fit-1:0] XLXN_42; 
wire [wid_TT+wid_fit-1:0] XLXN_43; 








defparam XLXI_1.n=wid_TT+wid_fit; 
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dff_N XLXI_1 (.CLK(CLK), 
-d(XLXN_1 [wid_TT+wid_fit-1:0]), 


-q (XLXN_9[wid_TT+wid_fit-1:0])); 


defparam XLXI_2.wid_TT=wid_TT; 


defparam XLXI_2.wid_fit=wid_fit; 





sort2 XLXI_2 (.a(a[wid_TT+wid_fit-1:0]), 
-b(b[wid_TT+wid_fit-1:0]), 
.aprime (XLXN_1 [wid_TT+wid_fit-1:0]), 


. bprime (XLXN_2 [wid_TT+wid_fit-1:0])); 


defparam XLXI_3.wid_TT=wid_TT; 


defparam XLXI_3.wid_fit=wid_fit; 





sort2 XLXI_3 (.a(c[wid_TT+wid_fit-1:0]), 
-b(d[wid_TT+wid_fit-1:0]), 
.aprime (XLXN_3 [wid_TT+wid_fit-1:0]), 


. bprime (XLXN_4 [wid_TT+wid_fit-1:0])); 





defparam XLXI_5.n=wid_TT+wid_fit; 





dff_N XLXI_5 (.CLK(CLK), 
-d(XLXN_2 [wid_TT+wid_fit-1:0]), 


-Q(XLXN_11[wid_TT+wid_fit-1:0])); 





defparam XLXI_6.n=wid_TT+wid_fit; 





dff_N XLXI_6 (.CLK(CLK), 





-d(XLXN_3 [wid_TT+wid_fit-1:0]), 


-Q(XLXN_10[wid_TT+wid_fit-1:0])); 





defparam XLXI_7.n=wid_TT+wid_fit; 





dff_N XLXI_7 (.CLK(CLK), 
-d(XLXN_4 [wid_TT+wid_fit-1:0]), 


-Q(XLXN_12 [wid_TT+wid_fit-1:0])); 





defparam XLXI_8.wid_TT=wid_TT; 


defparam XLXI_8.wid_fit=wid_fit; 





sort2 XLXI_8 (.a(XLXN_9[wid_TT+wid_fit-1:0]), 


-b (XLXN_10[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_13 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_14 [wid_TT+wid_fit-1:0])); 





defparam XLXI_9.wid_TT=wid_TT; 


defparam XLXI_9.wid_fit=wid_fit; 
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sort2 XLXI_9 (.a(XLXN_11[wid_TTt+wid_fit-1:0]), 





-b (XLXN_12 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_15 [wid_TT+wid_fit-1:0]), 





.bprime (XLXN_16 [wid_TT+wid_fit-1:0])); 








defparam XLXI_10.n=wid_TT+wid_fit; 
dff_N XLXI_10 (.CLK(CLK), 


-d(XLXN_13[wid_TT+wid_fit-1:0]), 





-Q(XLXN_40 [wid_TT+wid_fit-1:0])); 





defparam XLXI_11.n=wid_TT+wid_fit; 
daff_N XLXI_11 (.CLK(CLK), 


-d(XLXN_14[wid_TT+wid_fit-1:0]), 








-q (XLXN_36[wid_TT+wid_fit-1:0])); 





defparam XLXI_12.n=wid_TT+wid_fit; 
agff_N XLXI_12 (.CLK(CLK), 


-O(XLXN_15[wid_TT+wid_fit-1:0]), 





-Q(XLXN_37 [wid_TT+wid_fit-1:0])); 





defparam XLXI_13.n=wid_TT+wid_fit; 
dff_N XLXI_13 (.CLK(CLK), 


-d(XLXN_16[wid_TT+wid_fit-1:0]), 








-Q(XLXN_43 [wid_TT+wid_fit-1:0])); 





defparam XLXI_14.wid_TT=wid_ITT; 





defparam XLXI_14.wid_fit=wid_fit; 





sort2 XLXI_14 (.a(XLXN_36[wid_TT+wid_fit-1:0]), 





.b (XLXN_37[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_41 [wid_TT+wid_fit-1:0]), 





.bprime (XLXN_42 [wid_TT+wid_fit-1:0])); 





defparam XLXI_31.n=wid_TT+wid_fit; 


dff_N XLXI_31 (.CLK(CLK), 





-d(XLXN_40[wid_TT+wid_fit-1:0]), 





-Q(aprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_32.n=wid_TT+wid_fit; 
dff_N XLXI_32 (.CLK(CLK), 


-d(XLXN_41[wid_TT+wid_fit-1:0]), 








-q(bprime [wid_TT+wid_fit-1:0])); 
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defparam XLXI_33.n=wid_TT+wid_fit; 
dff_N XLXI_33 (.CLK(CLK), 


-d(XLXN_42 [wid_TT+wid_fit-1:0]), 





-Q(cprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_34.n=wid_TT+wid_fit; 
dff_N XLXI_34 (.CLK(CLK), 


-d(XLXN_43[wid_TT+wid_fit-1:0]), 





-q(dprime [wid_TT+wid_fit-1:0])); 


endmodule 


module sort8(a, 
b, 
Cr 


CLK, 


aprime, 
bprime, 
cprime, 
dprime, 
eprime, 
fprime, 
gprime, 


hprime) ; 


parameter wid_TT=14; 


parameter wid_fit=8; 


input [wid_TT+wid_fit-1:0] a; 
input [wid_TT+wid_fit-1:0] b; 


input [wid_TT+wid_fit-1:0] c; 

















input CLK; 

input [wid_TT+wid_fit-1:0] d; 

input [wid_TT+wid_fit-1:0] e; 

input [wid_TT+wid_fit-1:0] f; 

input [wid_TT+wid_fit-1:0] g; 

input [wid_TT+twid_fit-1:0 e 
output [wid_TT+wid_fit-1:0] aprime; 


output 
output 
output 
output 
output 
output 


output 


wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 


wire 





wire 





wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 


wid_l 





wid_TT+wid_fit-— 
wid _TT+wid_fit= 
wid_TT+wid_fit-— 
wid_TT+wid_fit- 
wid_TT+wid_fit-— 


wid_TT+wid_fit- 








wid_TT+wid_fit- 


[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 


[+wid_fit- 





[+wid_fit- 
[+wid_fit- 
[T+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[T+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit-— 
[+wid_fit- 
[+wid_fit- 


[+wid_fit- 





[T+wid_fit— 








wid_l 


[+wid_fit- 


oo oo oo oo oo oo oo oe oo oo oo oo oo oo oe oo oo oo oo oo oo oo oe oo oo oe oe . aes = s S 
oO oO ir | oO j=) Le | oO oO oO oO jo) a | j=) oO oO LP | i 3 oO jo) Le 3 oO oO Lo 3 oO Le | j=) oO jo) ie | 
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x x 
x x KX MN KX KH MN NM KH KH NM KM MUMUMlUMUM UMUC 


bprime; 
cprime; 
dprime; 
eprime; 
fprime; 
gprime; 


hprime; 


LXN_ 








wire 
wire 
wire 
wire 
wire 
wire 
wire 


wire 


sort4 XLXI_1 


sort4 XLXI_2 


sort2 XLXI_3 


wid_T 
wid_T 
wid_T 
wid_l 
wid_l 
wid_l 
wid_l 











wid_T 


[+wid_fit- 


t+wid_fit- 
t+wid_fit- 
t+wid_fit- 
t+wid_fit- 
t+wid_fit- 
t+wid_fit- 


t+twid_fit- 








oS. © © © © &. &. .& 


x MN MN MH NM NM KM OM 


LXN_60; 
LXN_61; 
LXN_62; 
LXN_63; 
LXN_64; 
LXN_65; 


LXN_66; 





LXN_67; 


defparam XLXI_1.wid_TT=wid_ITT; 


defparam XLXI_1l.wid_fit=wid_fit; 





(.a(a[wid_TT+wid_fit-1:0]), 


-b(b[wid_TT+wid_fit-1:0]), 


.c(c[wid_TT+wid_fit-1:0]), 


»CLK (CLK), 


-d(d[wid_TT+wid_fit-1:0]), 


.aprime (XLXN. 


11 [wid_TT+wid 





. bprime (XLXN. 


15 [wid_TT+wid 





.cprime (XLXN. 


13 [wid_TTt+wid 








. dprime (XLXN. 





17 [wid_TT+wid 





defparam XLXI_2.wid_TT=wid_TT; 


defparam XLXI_2.wid_fit=wid_fit; 





(.a(e[wid_TT+wid_fit-1:0]), 


b(t [wid _TT+wid fitelso07):, 


,ed@ [wid _TT+wid fit=-1:0)); 


»CLK (CLK), 


-d(h[wid_TT+wid_fit-1:0]), 


.aprime (XLXN. 


12 [wid_TT+wid 





. bprime (XLXN. 


16 [wid_TTt+wid 





.cprime (XLXN. 


14 [wid_TTt+wid 








. dprime (XLXN. 





18 [wid_TTt+wid 





defparam XLXI_3.wid_TT=wid_TT; 


defparam XLXI_3.wid_fit=wid_fit; 





fital: 
fit -1s 
fit 1s 


Exe=-1% 


fit= 1: 
fits1: 
fit -Ls 


Bits 


(.a(XLXN_11[wid_TT+wid_fit-1:0]), 





-b (XLXN_12 [wid_TT+wid_fit-1:0]), 





01), 
01), 
Ol), 
O1)); 
01), 
01), 
01), 
O1)); 


-aprime (XLXN_19[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_20 [wid_TT+wid_fit-1:0])); 





defparam XLXI_4.wid_TT=wid_TT; 


defparam XLXI_4.wid_fit=wid_fit; 
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sort2 XLXI_4 


defparam 
defparam 


sort2 XLXI_5 


defparam 
defparam 


sort2 XLXI_6 


defparam 


dff_N XLXI_7 


defparam 


dff_N XLXI_8 


defparam 


dff_N XLXI_9 


defparam 


dff_N XLXI_10 


defparam 


dff_N XLXI_11 


(.a(XLXN_13[wid_TT+wid_fit-1:0]), 





.b (XLXN_14[wid_TT+wid_fit-1:0]), 





-aprime (XLXN_21 [wid_TT+wid_fit-l: 





.bprime (XLXN_22 [wid_TT+wid_fit-l: 





XLXI_5.wid_TT=wid_TT; 


XLXI_5.wid_fit=wid_fit; 





(.a(XLXN_15 [wid_TT+wid_fit-1:0]), 





.b (XLXN_16[wid_TT+wid_fit-1:0]), 





-aprime (XLXN_23 [wid_TT+wid_fit-l: 








.bprime (XLXN_24 [wid_TT+wid_fit-1l: 


XLXI_6.wid_TT=wid_TT; 


XLXI_6.wid_fit=wid_fit; 





(.a(XLXN_17[wid_TT+wid_fit-1:0]), 





-b (XLXN_18 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_25 [wid_TT+wid_fit-l: 





.bprime (XLXN_26 [wid_TT+wid_fit-l: 





XLXI_7.n=wid_TT+wid_fit; 





(.CLK (CLK) , 


-d(XLXN_19[wid_TT+wid_fit-1:0]), 





-q (XLXN_32 [wid_TT+wid_fit-1:0])); 





XLXI_8.n=wid_TT+wid_fit; 





(.CLK (CLK) , 


-d(XLXN_23 [wid_TT+wid_fit-1:0]), 








-q (XLXN_33 [wid_TT+wid_fit-1:0])); 


XLXI_9.n=wid_TT+wid_fit; 





(.CLK (CLK) , 


-d(XLXN_21[wid_TT+wid_fit-1:0]), 





-q(XLXN_28 [wid_TT+wid_fit-1:0])); 





XLXI_10.n=wid_TT+wid_fit; 
(.CLK (CLK), 


-d(XLXN_25[wid_TT+wid_fit-1:0]), 





-q (XLXN_30[wid_TT+wid_fit-1:0])); 





XLXI_11.n=wid_TT+wid_fit; 
(.CLK (CLK) , 


-d(XLXN_20[wid_TT+wid_fit-1:0]), 
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01), 
O1)); 
01), 
O1)); 
01), 
O1)); 


-q(XLXN_29[wid_TT+wid_fit-1:0])); 





defparam XLXI_12.n=wid_TT+wid_fit; 
dadff_N XLXI_12 (.CLK(CLK), 


-d(XLXN_24[wid_TT+wid_fit-1:0]), 





-Q(XLXN_31[wid_TT+wid_fit-1:0])); 





defparam XLXI_13.n=wid_TT+wid_fit; 
dff_N XLXI_13 (.CLK(CLK), 


-d(XLXN_22 [wid_TT+wid_fit-1:0]), 





-q (XLXN_38 [wid_TT+wid_fit-1:0])); 





defparam XLXI_14.n=wid_TT+wid_fit; 
aff_N XLXI_14 (.CLK(CLK), 


-d(XLXN_26[wid_TT+wid_fit-1:0]), 








-q (XLXN_39[wid_TT+wid_fit-1:0])); 


defparam XLXI_23.wid_TT=wid_ITT; 





defparam XLXI_23.wid_fit=wid_fit; 





sort2 XLXI_23 (.a(XLXN_28 [wid_TT+wid_fit-1:0]), 





-b (XLXN_29 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_34 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_35 [wid_TT+wid_fit-1:0])); 





defparam XLXI_24.wid_TT=wid_ITT; 





defparam XLXI_24.wid_fit=wid_fit; 





sort2 XLXI_24 (.a(XLXN_30[wid_TTt+wid_fit-1:0]), 





-b (XLXN_31[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_36 [wid_TT+wid_fit-1:0]), 





.bprime (XLXN_37 [wid_TT+wid_fit-1:0])); 





defparam XLXI_25.n=wid_TT+wid_fit; 
dff_N XLXI_25 (.CLK(CLK), 


-d(XLXN_32 [wid_TT+wid_fit-1:0]), 





-q(XLXN_60[wid_TT+wid_fit-1:0])); 





defparam XLXI_26.n=wid_TT+wid_fit; 
dff_N XLXI_26 (.CLK(CLK), 


-d(XLXN_33 [wid_TT+wid_fit-1:0]), 





-Q(XLXN_46 [wid_TT+wid_fit-1:0])); 





defparam XLXI_27.n=wid_TT+wid_fit; 
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dff_N XLXI_27 (.CLK(CLK), 


-d(XLXN_34[wid_TT+wid_fit-1:0]), 





-q(XLXN_47[wid_TT+wid_fit-1:0])); 





defparam XLXI_29.n=wid_TT+wid_fit; 
dff_N XLXI_29 (.CLK(CLK), 


-d(XLXN_36 [wid_TT+wid_fit-1:0]), 





-Q(XLXN_48 [wid_TT+wid_fit-1:0])); 





defparam XLXI_30.n=wid_TT+wid_fit; 
dff_N XLXI_30 (.CLK(CLK), 


-d(XLXN_35 [wid_TT+wid_fit-1:0]), 





-Q(XLXN_49 [wid_TT+wid_fit-1:0])); 





defparam XLXI_31.n=wid_TT+wid_fit; 
dff_N XLXI_31 (.CLK(CLK), 


-d(XLXN_37[wid_TT+wid_fit-1:0]), 





-Q(XLXN_50 [wid_TT+wid_fit-1:0])); 





defparam XLXI_32.n=wid_TT+wid_fit; 
dff_N XLXI_32 (.CLK(CLK), 


-d(XLXN_38 [wid_TT+wid_fit-1:0]), 





-q(XLXN_51[wid_TT+wid_fit-1:0])); 





defparam XLXI_33.n=wid_TT+wid_fit; 
dff_N XLXI_33 (.CLK(CLK), 


-d(XLXN_39[wid_TT+wid_fit-1:0]), 








-Q(XLXN_67 [wid_TT+wid_fit-1:0])); 


defparam XLXI_34.wid_TT=wid_ITT; 





defparam XLXI_34.wid_fit=wid_fit; 





sort2 XLXI_34 (.a(XLXN_46[wid_TT+wid_fit-1:0]), 





-b (XLXN_47[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_61 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_62 [wid_TT+wid_fit-1:0])); 





defparam XLXI_35.wid_TT=wid_ITT; 





defparam XLXI_35.wid_fit=wid_fit; 





sort2 XLXI_35 (.a(XLXN_48 [wid_TT+wid_fit-1:0]), 





.b (XLXN_49[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_63 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_64 [wid_TT+wid_fit-—1:0])); 
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defparam XLXI_36.wid_TT=wid_TT; 





defparam XLXI_36.wid_fit=wid_fit; 





sort2 XLXI_36 (.a(XLXN_50[wid_TT+wid_fit-1:0]), 





-b (XLXN_51[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_65 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_66 [wid_TT+wid_fit-1:0])); 





defparam XLXI_45.n=wid_TT+wid_fit; 
dff_N XLXI_45 (.CLK(CLK), 


-d(XLXN_60 [wid_TT+wid_fit-1:0]), 





-q(aprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_46.n=wid_TT+wid_fit; 
dff_N XLXI_46 (.CLK(CLK), 


-d(XLXN_61[wid_TT+wid_fit-1:0]), 





-q(bprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_47.n=wid_TT+wid_fit; 
adff_N XLXI_47 (.CLK(CLK), 


.d(XLXN_62 [wid_TT+wid_fit-1:0]), 





-q(cprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_48.n=wid_TT+wid_fit; 
dff_N XLXI_48 (.CLK(CLK), 


-d(XLXN_63 [wid_TT+wid_fit-1:0]), 





-q(dprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_49.n=wid_TT+wid_fit; 
dff_N XLXI_49 (.CLK(CLK), 


-d(XLXN_64[wid_TT+wid_fit-1:0]), 





-q(eprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_50.n=wid_TT+wid_fit; 
dff_N XLXI_50 (.CLK(CLK), 


-d(XLXN_65 [wid_TT+wid_fit-1:0]), 





-q(fprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_51.n=wid_TT+wid_fit; 
dff_N XLXI_51 (.CLK(CLK), 


-d(XLXN_66 [wid_TT+wid_fit-1:0]), 





-q(gprime [wid_TT+wid_fit-1:0])); 
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defparam XLXI_52.n=wid_TT+wid_fit; 
dff_N XLXI_52 (.CLK(CLK), 


-d(XLXN_67[wid_TT+wid_fit-1:0]), 





-q(hprime [wid_TT+wid_fit-1:0])); 


endmodule 


module sort(a, b, c, d, e, f, g, h, i, Jj, k, 1, m, n, 0, p,CLK, aprime, bprime, 
cprime, dprime, eprime, fprime, gprime, hprime, iprime, jprime, kprime, lprime, mprime, 
nprime, oprime, pprime); 


parameter wid_TT=14; 


parameter wid_fit=8; 


input [wid_TT+wid_fit-1:0] a; 
input [wid_TT+wid_fit-1:0] b; 
input [wid_TT+wid_fit-1:0] c; 
//input CLK; 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 
input [wid_TT+wid_fit- 


input [wid_TT+wid_fit- Oo; 


Ler | ite 3 oO ©. oO jo) oO i 3 oO oO oO oO oO 
us 
s 


input [wid_TT+wid_fit- Pp; 




















input CLK /* synthesis syn_noclockbuf=1 syn_maxfan=100000 */; 
output [wid_TT+wid_fit-1:0] aprime; 
output [wid_TT+wid_fit-1:0] bprime; 
output [wid_TT+wid_fit-1:0] cprime; 
output [wid_TT+wid_fit-1:0] dprime; 
output [wid_TT+wid_fit-1:0] eprime; 
output [wid_TT+wid_fit-1:0] fprime; 
output [wid_TT+wid_fit-1:0] gprime; 
output [wid_TT+wid_fit-1:0] hprime; 
output [wid_TT+wid_fit-1:0] iprime; 
output [wid_TT+wid_fit-1:0] jprime; 
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output 
output 
output 
output 
output 


output 


wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 


wire 





wire 





wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 
wid_T1 


wid_l 





wid_TT+wid_fit-— 
wid _TT+wid_fit= 
wid_TT+wid_fit-— 
wid_TT+wid_fit-— 


wid_TT+wid_fit- 











wid_TT+wid_fit- 


[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 


[T+wid_fit- 





[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[T+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit-— 
[+wid_fit- 
[+wid_fit- 


[+wid_fit- 





[T+wid_fit— 





wid_l 





[+wid_fit- 
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x MM 
a sa a 2 


kprime; 
lprime; 
mprime; 
nprime; 
oprime; 


pprime; 


LXN_17; 
LXN_31; 
LXN_49; 
LXN_50; 
LXN_51; 
LXN_52; 
LXN_53; 
LXN_54; 
LXN_55; 
LXN_56; 
LXN_57; 
LXN_58; 
LXN_59; 
LXN_60; 
LXN_61; 
LXN_62; 
LXN_63; 
LXN_64; 
LXN_65; 
LXN_66; 
LXN_67; 
LXN_68; 
LXN_69; 
LXN_70; 
LXN_71; 
LXN_72; 
LXN_73; 
LXN_74; 
LXN_75; 
LXN_76; 
LXN_77; 
LXN_78; 
LXN_278; 


LXN_424; 





LXN_452; 
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wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 
wire 


wire 





wid_l 
wid_l 
wid_l 
wid_l 
wid_t 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_T 
wid_l 
wid_l 
wid_l 
wid_1 
wid_T 
wid_l 
wid_1 
wid_l 
wid_l 
wid_1 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 
wid_l 


wid_l 





wid_l 


[+wid_fit-— 
[+wid_fit- 
[+wid_fit- 
[T+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 


[T+wid_fit- 





[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[T+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit— 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit- 
[+wid_fit-— 
[+wid_fit- 
[+wid_fit- 


[+wid_fit- 





[T+wid_fit— 





[+wid_fit- 
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x =~ KM KM NM OM 
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LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 
LXN_4 


LXN_4 








LXN_4 


wire [wid_TT+wid_fit- LXN_495; 


wire [wid_TT+wid_fit- LXN_496; 


wire [wid_TT+wid_fit- LXN_497; 


wire [wid_TT+wid_fit- LXN_498; 


wire [wid_TT+wid_fit- LXN_499; 


wire [wid_TT+wid_fit- LXN_500; 


wire [wid_TT+wid_fit- LXN_501; 


wire [wid_TT+wid_fit- LXN_502; 


wire [wid_TT+wid_fit- LXN_503; 


wire [wid_TT+wid_fit- LXN_515; 


wire [wid_TT+wid_fit- LXN_517; 


wire [wid_TT+wid_fit- LXN_518; 
wire [wid_TT+wid_fit- LXN_519; 
wire [wid_TT+wid_fit- LXN_520; 
wire [wid_TT+wid_fit- LXN_521; 
wire [wid_TT+wid_fit- LXN_522; 


wire [wid_TT+wid_fit- LXN_524; 





wire [wid_TT+wid_fit- LXN_525; 


wire [wid_TT+wid_fit- 


wire [wid_TT+wid_fit-— LXN_527; 


wire [wid_TT+wid_fit- LXN_528; 


wire [wid_TT+wid_fit- LXN_529; 


wire [wid_TT+wid_fit- LXN_530; 


wire [wid_TT+wid_fit- LXN_531; 


wire [wid_TT+wid_fit- LXN_532; 


wire [wid_TT+wid_fit- LXN_533; 


wire [wid_TT+wid_fit- LXN_534; 


wire [wid_TT+wid_fit- LXN_535; 


wire [wid_TT+wid_fit-— LXN_536; 
wire [wid_TT+wid_fit-— LXN_537; 
wire [wid_TT+wid_fit-— LXN_538; 
wire [wid_TT+wid_fit— LXN_539; 
wire [wid_TT+wid_fit- LXN_540; 
wire [wid_TT+wid_fit-— LXN_541; 
wire [wid_TT+wid_fit— LXN_542; 


wire [wid_TT+wid_fit- LXN_543; 
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x 
Xx 
xX 
x 
x 
Xx 
xX 
Xx 
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Xx 
xX 
Xx 
Xx 
xX 
Xx 
xX 
Xx 
x 
XLXN_526; 
Xx 
Xx 
Xx 
Xx 
Xx 
Xx 
Xx 
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X 
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X 
X 
X 
X 
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X 
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wire [wid_TT+wid_fit- LXN_544; 


defparam XLXI_1.wid_TT=wid_TT; 





defparam XLXI_1l.wid_fit=wid_fit; 
sort8 XLXI_1 (.a(a[wid_TT+wid_fit-1:0]), 


-b(b[wid_TT+wid_fit-1:0]), 
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-c(c[wid_TTtwid_fit-1:0]), 
-CLK (CLK), 
.d(d[wid_TT+wid_fit- 


.e(e[wid_TT+wid_fit- 


-g(g[wid_TT+wid_fit- 











0 

0 
-£(£[wid_TT+wid_fit-1:0]), 

0 

0 


-h(h[wid_TT+wid_fit- 


.aprime (XLXN_17 [wid_TT+wid_fit-— 





.bprime (XLXN_51 [wid_TT+wid_fit-— 





.cprime (XLXN_55 [wid_TT+wid_fit-— 








-dprime (XLXN_59 [wid_TT+wid_fit-— 


.eprime (XLXN_63 [wid_TT+wid_fit-— 





. fprime (XLXN_67 [wid_TT+wid_fit-— 





-gprime (XLXN_71 [wid_TT+wid_fit— 




















-hprime (XLXN_75 [wid_TT+wid_fit-— 





defparam XLXI_2.wid_TT=wid_TT; 


defparam XLXI_2.wid_fit=wid_fit; 





sort8 XLXI_2 (.a(i[lwid_TT+wid_fit-1:0]), 
.b(j[wid_TT+wid_fit-1:0]), 
.c(k[wid_TT+wid_fit-1:0]), 
»CLK (CLK), 
-d(1l[wid_TT+wid_fit- 


.e(m[wid_TT+wid_fit- 


-g(o[wid_TT+wid_fit- 











0 

0 
-£(n[wid_TT+wid_fit-1:0]), 

0 

0 


-h(p[wid_TT+wid_fit- 





-aprime (XLXN_31 [wid_TT+wid_fit-— 


.bprime (XLXN_52 [wid_TT+wid_fit-— 





.cprime (XLXN_56 [wid_TT+wid_fit-— 





. dprime (XLXN_60 [wid_TT+wid_fit-— 





-eprime (XLXN_64 [wid_TT+wid_fit-— 





. fprime (XLXN_68 [wid_TT+wid_fit-— 





-gprime (XLXN_72 [wid_TT+wid_fit— 




















-hprime (XLXN_76 [wid_TT+wid_fit-— 





defparam XLXI_3.wid_TT=wid_TT; 


defparam XLXI_3.wid_fit=wid_fit; 





sort2 XLXI_3 (.a(XLXN_17[wid_TT+wid_fit-1:0]), 





.b (XLXN_31[wid_TT+wid_fit-1:0]), 





-aprime (XLXN_49 [wid_TT+wid_fit-1l: 





.bprime (XLXN_50 [wid_TT+wid_fit-l: 
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defparam XLXI_4.wid_TT=wid_TT; 


defparam XLXI_4.wid_fit=wid_fit; 








sort2 XLXI_4 (.a(XLXN_51[wid_TTt+wid_fit-1:0]), 


-b (XLXN_52 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_53 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_54 [wid_TT+wid_fit-1:0])); 





defparam XLXI_5.wid_TT=wid_TT; 


defparam XLXI_5.wid_fit=wid_fit; 





sort2 XLXI_5 (.a(XLXN_55 [wid_TT+wid_fit-1:0]), 





.b(XLXN_56 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_57 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_58 [wid_TT+wid_fit-1:0])); 





defparam XLXI_6.wid_TT=wid_ITT; 


defparam XLXI_6.wid_fit=wid_fit; 





sort2 XLXI_6 (.a(XLXN_59[wid_TTt+wid_fit-1:0]), 





-b (XLXN_60 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_61 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_62 [wid_TT+wid_fit-1:0])); 





defparam XLXI_7.wid_TT=wid_TT; 


defparam XLXI_7.wid_fit=wid_fit; 





sort2 XLXI_7 (.a(XLXN_63 [wid_TT+wid_fit-1:0]), 





-b (XLXN_64[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_65 [wid_TT+wid_fit-1:0]), 








.bprime (XLXN_66 [wid_TT+wid_fit-1:0])); 


defparam XLXI_8.wid_TT=wid_TT; 


defparam XLXI_8.wid_fit=wid_fit; 





sort2 XLXI_8 (.a(XLXN_67 [wid_TT+wid_fit-1:0]), 





.b(XLXN_68 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_69[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_70 [wid_TT+wid_fit-1:0])); 





defparam XLXI_9.wid_TT=wid_TT; 


defparam XLXI_9.wid_fit=wid_fit; 





sort2 XLXI_9 (.a(XLXN_71[wid_TT+wid_fit-1:0]), 





-b (XLXN_72 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_73 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_74 [wid_TT+wid_fit-1:0])); 
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defparam XLXI_10.wid_TT=wid_ITT; 





defparam XLXI_10.wid_fit=wid_fit; 





sort2 XLXI_10 (.a(XLXN_75[wid_TT+wid_fit-1:0]), 





-b (XLXN_76[wid_TT+wid_fit-1:0]), 





.aprime (XLXN_77 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_78 [wid_TT+wid_fit-1:0])); 





defparam XLXI_27.n=wid_TT+wid_fit; 
agff_N XLXI_27 (.CLK(CLK), 


-d(XLXN_49[wid_TT+wid_fit-1:0]), 





-q (XLXN_452 [wid_TT+wid_fit-1:0])); 





defparam XLXI_28.n=wid_TT+wid_fit; 
dff_N XLXI_28 (.CLK(CLK), 


-d(XLXN_53 [wid_TT+wid_fit-1:0]), 





-q (XLXN_453 [wid_TT+wid_fit-1:0])); 





defparam XLXI_29.n=wid_TT+wid_fit; 
dff_N XLXI_29 (.CLK(CLK), 


-d(XLXN_57[wid_TT+wid_fit-1:0]), 





-q (XLXN_454 [wid_TT+wid_fit-1:0])); 





defparam XLXI_30.n=wid_TT+wid_fit; 
dff_N XLXI_30 (.CLK(CLK), 


-d(XLXN_61[wid_TT+wid_fit-1:0]), 





-q (XLXN_455 [wid_TT+wid_fit-1:0])); 





defparam XLXI_31.n=wid_TT+wid_fit; 
dff_N XLXI_31 (.CLK(CLK), 


-d(XLXN_65 [wid_TT+wid_fit-1:0]), 





-q (XLXN_460 [wid_TT+wid_fit-1:0])); 





defparam XLXI_32.n=wid_TT+wid_fit; 
dff_N XLXI_32 (.CLK(CLK), 


-d(XLXN_69[wid_TT+wid_fit-1:0]), 





-q (XLXN_464 [wid_TT+wid_fit-1:0])); 





defparam XLXI_33.n=wid_TT+wid_fit; 
dff_N XLXI_33 (.CLK(CLK), 


-d(XLXN_73[wid_TT+wid_fit-1:0]), 





-q (XLXN_468 [wid_TT+wid_fit-1:0])); 
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defparam XLXI_34.n=wid_TT+wid_fit; 
dff_N XLXI_34 (.CLK(CLK), 


-dO(XLXN_77[wid_TT+wid_fit-1:0]), 





-q (XLXN_472 [wid_TT+wid_fit-1:0])); 





defparam XLXI_35.n=wid_TT+wid_fit; 
dff_N XLXI_35 (.CLK(CLK), 


-d(XLXN_50 [wid_TT+wid_fit-1:0]), 





-q (XLXN_462 [wid_TT+wid_fit-1:0])); 





defparam XLXI_36.n=wid_TT+wid_fit; 
dff_N XLXI_36 (.CLK(CLK), 


-d(XLXN_54[wid_TT+wid_fit-1:0]), 





-q (XLXN_466 [wid_TT+wid_fit-1:0])); 





defparam XLXI_37.n=wid_TT+wid_fit; 
dff_N XLXI_37 (.CLK(CLK), 


-d(XLXN_58 [wid_TT+wid_fit-1:0]), 





-q (XLXN_470 [wid_TT+wid_fit-1:0])); 





defparam XLXI_38.n=wid_TT+wid_fit; 
dff_N XLXI_38 (.CLK(CLK), 


-d(XLXN_62 [wid_TT+wid_fit-1:0]), 





-q (XLXN_473 [wid_TT+wid_fit-1:0])); 





defparam XLXI_43.n=wid_TT+wid_fit; 
dff_N XLXI_43 (.CLK(CLK), 


-d(XLXN_66 [wid_TT+wid_fit-1:0]), 





-q (XLXN_459 [wid_TT+wid_fit-1:0])); 





defparam XLXI_44.n=wid_TT+wid_fit; 
adff_N XLXI_44 (.CLK(CLK), 


-d(XLXN_70[wid_TT+wid_fit-1:0]), 





-q (XLXN_458 [wid_TT+wid_fit-1:0])); 





defparam XLXI_45.n=wid_TT+wid_fit; 
dff_N XLXI_45 (.CLK(CLK), 


-d(XLXN_74[wid_TT+wid_fit-1:0]), 








-q (XLXN_457 [wid_TT+wid_fit-1:0])); 


defparam XLXI_46.n=wid_TT+wid_fit; 
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dff_N XLXI_46 (.CLK(CLK), 


-d(XLXN_78 [wid_TT+wid_fit-1:0]), 





-q (XLXN_456 [wid_TT+wid_fit-1:0])); 





defparam XLXI_55.n=wid_TT+wid_fit; 
dff_N XLXI_55 (.CLK(CLK), 


-d(XLXN_452 [wid_TT+wid_fit-1:0]), 





-q (XLXN_476 [wid_TT+wid_fit-1:0])); 





defparam XLXI_56.n=wid_TT+wid_fit; 
dff_N XLXI_56 (.CLK(CLK), 


-d(XLXN_453 [wid_TT+wid_fit-1:0]), 





-q (XLXN_477 [wid_TT+wid_fit-1:0])); 





defparam XLXI_57.n=wid_TT+wid_fit; 
dff_N XLXI_57 (.CLK(CLK), 


-d(XLXN_454 [wid_TT+wid_fit-1:0]), 





-q (XLXN_480 [wid_TT+wid_fit-1:0])); 





defparam XLXI_58.n=wid_TT+wid_fit; 
dff_N XLXI_58 (.CLK(CLK), 


-d(XLXN_455 [wid_TT+wid_fit-1:0]), 





-q (XLXN_492 [wid_TT+wid_fit-1:0])); 





defparam XLXI_59.n=wid_TT+wid_fit; 
dff_N XLXI_59 (.CLK(CLK), 


-d(XLXN_461 [wid_TT+wid_fit-1:0]), 








-q (XLXN_482 [wid_TT+wid_fit-1:0])); 


defparam XLXI_60.n=wid_TT+wid_fit; 
dff_N XLXI_60 (.CLK(CLK), 


-d(XLXN_465 [wid_TT+wid_fit-1:0]), 





-q (XLXN_493 [wid_TT+wid_fit-1:0])); 





defparam XLXI_61.n=wid_TT+wid_fit; 
dff_N XLXI_61 (.CLK(CLK), 


-d(XLXN_469 [wid_TT+wid_fit-1:0]), 





-q (XLXN_496 [wid_TT+wid_fit-1:0])); 





defparam XLXI_62.n=wid_TT+wid_fit; 
dff_N XLXI_62 (.CLK(CLK), 


-d(XLXN_475 [wid_TT+wid_fit-1:0]), 


166 





-Q(XLXN_502 [wid_TT+wid_fit-1l: 





defparam XLXI_63.n=wid_TT+wid_fit; 
dff_N XLXI_63 (.CLK(CLK), 


-d(XLXN_463 [wid_TT+wid_fit-1: 





-Q(XLXN_497 [wid_TT+wid_fit-1l: 





defparam XLX1I_64.n=wid_TT+wid_fit; 
dff_N XLXI_64 (.CLK(CLK), 


-d(XLXN_467 [wid_TT+wid_fit-1: 








-Q(XLXN_503 [wid_TT+wid_fit-1l: 


defparam XLXI_65.n=wid_TT+wid_fit; 
dff_N XLXI_65 (.CLK(CLK), 


-d(XLXN_471 [wid_TT+wid_fit-1: 





-Q(XLXN_491 [wid_TT+wid_fit-1l: 





defparam XLXI_66.n=wid_TT+wid_fit; 
dff_N XLXI_66 (.CLK(CLK), 


-d(XLXN_474 [wid_TT+wid_fit-1: 





-Q(XLXN_487 [wid_TT+wid_fit-1l: 





defparam XLXI_67.n=wid_TT+wid_fit; 
dff_N XLXI_67 (.CLK(CLK), 


-d(XLXN_459 [wid_TT+wid_fit-1: 





-Q(XLXN_488 [wid_TT+wid_fit-1l: 





defparam XLXI_68.n=wid_TT+wid_fit; 
dff_N XLXI_68 (.CLK(CLK), 


-d(XLXN_458 [wid_TT+wid_fit-1: 





-Q(XLXN_484 [wid_TT+wid_fit-1l: 





defparam XLXI_69.n=wid_TT+wid_fit; 
dff_N XLXI_69 (.CLK(CLK), 


-d(XLXN_457 [wid_TT+wid_fit-1: 





-Q(XLXN_478 [wid_TT+wid_fit-1l: 





defparam XLXI_70.n=wid_TT+wid_fit; 
dff_N XLXI_70 (.CLK(CLK), 


-d(XLXN_456 [wid_TT+wid_fit-1: 





-Q(XLXN_479 [wid_TT+wid_fit-1l: 
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defparam XLXI_145.n=wid_TT+wid_fit; 


dff_N XLXI_145 (.CLK(CLK), 


-d(XLXN_476 [wid_TT+wid_fit-1: 





-Q(XLXN_278 [wid_TT+wid_fit-1l: 





defparam XLXI_146.n=wid_TT+wid_fit; 


dff_N XLXI_146 (.CLK(CLK), 


-d(XLXN_477 [wid_TT+wid_fit-1: 





-q(XLXN_515 [wid_TT+wid_fit-1: 





defparam XLXI_147.n=wid_TT+wid_fit; 


dff_N XLXI_147 (.CLK(CLK), 


-d(XLXN_481 [wid_TT+wid_fit-1: 





-q (XLXN_517 [wid_TT+wid_fit-1: 





defparam XLX1I_148.n=wid_TT+wid_fit; 


dff_N XLXI_148 (.CLK(CLK), 


-d(XLXN_494 [wid_TT+wid_fit-1: 





-Q(XLXN_520 [wid_TT+wid_fit-1l: 





defparam XLXI_149.n=wid_TT+wid_fit; 


dff_N XLXI_149 (.CLK(CLK), 


-d(XLXN_483 [wid_TT+wid_fit-1: 





-Q(XLXN_521 [wid_TT+wid_fit-1l: 





defparam XLXI_150.n=wid_TT+wid_fit; 


dff_N XLXI_150 (.CLK(CLK), 


-d(XLXN_498 [wid_TT+wid_fit-1: 





-Q(XLXN_525 [wid_TT+wid_fit-1l: 





defparam XLXI_151.n=wid_TT+wid_fit; 


dff_N XLXI_151 (.CLK(CLK), 


-d(XLXN_495 [wid_TT+wid_fit-1: 





-Q(XLXN_526 [wid_TTt+wid_fit-1l: 





defparam XLXI_152.n=wid_TT+wid_fit; 


dff_N XLXI_152 (.CLK(CLK), 


-d(XLXN_500 [wid_TT+wid_fit-1: 





-Q(XLXN_529[wid_TT+wid_fit-1l: 





defparam XLXI_153.n=wid_TT+wid_fit; 


dff_N XLXI_153 (.CLK(CLK), 
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-d(XLXN_499 [wid_TT+wid_fit-1:0]), 





-q (XLXN_530 [wid_TT+wid_fit-1:0])); 





defparam XLXI_154.n=wid_TT+wid_fit; 
dff_N XLXI_154 (.CLK(CLK), 


-d(XLXN_501[wid_TT+wid_fit-1:0]), 





-q (XLXN_533 [wid_TT+wid_fit-1:0])); 





defparam XLXI_155.n=wid_TT+wid_fit; 
dff_N XLXI_155 (.CLK(CLK), 


-d(XLXN_490 [wid_TT+wid_fit-1:0]), 





-q (XLXN_534 [wid_TT+wid_fit-1:0])); 





defparam XLXI_156.n=wid_TT+wid_fit; 
dff_N XLXI_156 (.CLK(CLK), 


-d(XLXN_486 [wid_TT+wid_fit-1:0]), 





-q (XLXN_537 [wid_TT+wid_fit-1:0])); 





defparam XLXI_157.n=wid_TT+wid_fit; 
dff_N XLXI_157 (.CLK(CLK), 


-d(XLXN_489 [wid_TT+wid_fit-1:0]), 





-q (XLXN_538 [wid_TT+wid_fit-1:0])); 





defparam XLXI_158.n=wid_TT+wid_fit; 
dff_N XLXI_158 (.CLK(CLK), 


-d(XLXN_485 [wid_TT+wid_fit-1:0]), 





-q (XLXN_544 [wid_TT+wid_fit-1:0])); 





defparam XLXI_159.n=wid_TT+wid_fit; 
dff_N XLXI_159 (.CLK(CLK), 


-d(XLXN_478 [wid_TT+wid_fit-1:0]), 





-q (XLXN_543 [wid_TT+wid_fit-1:0])); 





defparam XLXI_160.n=wid_TT+wid_fit; 
dff_N XLXI_160 (.CLK(CLK), 


-d(XLXN_479 [wid_TT+wid_fit-1:0]), 





-q (XLXN_424 [wid_TT+wid_fit-1:0])); 





defparam XLXI_191.n=wid_TT+wid_fit; 
dff_N XLXI_191 (.CLK(CLK), 


-d(XLXN_278 [wid_TT+wid_fit-1:0]), 





-q(aprime [wid_TT+wid_fit-1:0])); 


169 


defparam XLXI_192.n=wid_TT+wid_fit; 
dff_N XLXI_192 (.CLK(CLK), 


-d(XLXN_519 [wid_TT+wid_fit-1:0]), 





-q(bprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_193.n=wid_TT+wid_fit; 
dff_N XLXI_193 (.CLK(CLK), 


-d(XLXN_518 [wid_TT+wid_fit-1:0]), 





-Q(cprime[wid_TT+wid_fit-1:0])); 


defparam XLXI_194.n=wid_TT+wid_fit; 
dff_N XLXI_194 (.CLK(CLK), 


-d(XLXN_524 [wid_TT+wid_fit-1:0]), 





-q(dprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_195.n=wid_TT+wid_fit; 
dff_N XLXI_195 (.CLK(CLK), 


-d(XLXN_522 [wid_TT+wid_fit-1:0]), 





-q(eprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_196.n=wid_TT+wid_fit; 
dff_N XLXI_196 (.CLK(CLK), 


-d(XLXN_527 [wid_TT+wid_fit-1:0]), 





-q(fprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_197.n=wid_TT+wid_fit; 
dff_N XLXI_197 (.CLK(CLK), 


-d(XLXN_528 [wid_TT+wid_fit-1:0]), 





-Q(gprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_198.n=wid_TT+wid_fit; 
dff_N XLXI_198 (.CLK(CLK), 


-d(XLXN_531[wid_TT+wid_fit-1:0]), 





-q(hprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_199.n=wid_TT+wid_fit; 
dff_N XLXI_199 (.CLK(CLK), 


-d(XLXN_532 [wid_TT+wid_fit-1:0]), 





-q(iprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_200.n=wid_TT+wid_fit; 
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dff_N XLXI_200 (.CLK(CLK), 


-d(XLXN_535 [wid_TT+wid_fit-1:0]), 





-Q(jprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_201.n=wid_TT+wid_fit; 
dff_N XLXI_201 (.CLK(CLK), 


-d(XLXN_536 [wid_TT+wid_fit-1:0]), 





-q(kprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_202.n=wid_TT+wid_fit; 
dff_N XLXI_202 (.CLK(CLK), 


-d(XLXN_539 [wid_TT+wid_fit-1:0]), 





-q(lprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_203.n=wid_TT+wid_fit; 
dff_N XLXI_203 (.CLK(CLK), 


-d(XLXN_540 [wid_TT+wid_fit-1:0]), 





-q(mprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_204.n=wid_TT+wid_fit; 
dff_N XLXI_204 (.CLK(CLK), 


-d(XLXN_541[wid_TT+wid_fit-1:0]), 





-q(nprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_205.n=wid_TT+wid_fit; 
dff_N XLXI_205 (.CLK(CLK), 


-d(XLXN_542 [wid_TT+wid_fit-1:0]), 





-q(oprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_206.n=wid_TT+wid_fit; 
dff_N XLXI_206 (.CLK(CLK), 


-d(XLXN_424 [wid_TT+wid_fit-1:0]), 





-q(pprime [wid_TT+wid_fit-1:0])); 


defparam XLXI_282.wid_TT=wid_TT; 





defparam XLX1I_282.wid_fit=wid_fit; 





sort2 XLXI_282 (.a(XLXN_460[wid_TT+wid_fit-1:0]), 





-b (XLXN_462 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_461[wid_TT+wid_fit-1:0]), 








. bprime (XLXN_463 [wid_TT+wid_fit-1:0])); 


defparam XLXI_283.wid_TT=wid_TT; 
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defparam XLX1I_283.wid_fit=wid_fit; 





sort2 XLXI_283 (.a(XLXN_464 [wid_TT+wid_fit-1:0]), 





-b (XLXN_466 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_465[wid_TT+wid_fit-1:0]), 





- bprime (XLXN_467[wid_TT+wid_fit-1:0])); 








defparam XLXI_284.wid_TT=wid_TT; 


defparam XLX1I_284.wid_fit=wid_fit; 





sort2 XLXI_284 (.a(XLXN_468 [wid_TT+wid_fit-1:0]), 





-b (XLXN_470 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_469[wid_TT+wid_fit-1:0]), 








. bprime (XLXN_471[wid_TT+wid_fit-1:0])); 


defparam XLXI_285.wid_TT=wid_TT; 





defparam XLX1I_285.wid_fit=wid_fit; 





sort2 XLXI_285 (.a(XLXN_472 [wid_TT+wid_fit-1:0]), 





-b (XLXN_473 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_475[wid_TT+wid_fit-1:0]), 





- bprime (XLXN_474 [wid_TT+wid_fit-1:0])); 





defparam XLXI_286.wid_TT=wid_TT; 





defparam XLXI_286.wid_fit=wid_fit; 





sort2 XLXI_286 (.a(XLXN_480[wid_TT+wid_fit-1:0]), 





-b (XLXN_482 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_481[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_483 [wid_TT+wid_fit-1:0])); 





defparam XLX1I_287.wid_TT=wid_TT; 





defparam XLX1I_287.wid_fit=wid_fit; 





sort2 XLXI_287 (.a(XLXN_492 [wid_TT+wid_fit-1:0]), 





-b (XLXN_493 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_494[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_495 [wid_TT+wid_fit-1:0])); 





defparam XLXI_288.wid_TT=wid_TT; 








defparam XLX1I_288.wid_fit=wid_fit; 


sort2 XLXI_288 (.a(XLXN_496[wid_TT+wid_fit-1:0]), 





-b (XLXN_497 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_498 [wid_TT+wid_fit-1:0]), 





. bprime (XLXN_499 [wid_TT+wid_fit-1:0])); 





defparam XLXI_289.wid_TT=wid_TT; 
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defparam XLXI_289.wid_fit=wid_fit; 





sort2 XLXI_289 (.a(XLXN_502 [wid_TT+wid_fit-1:0]), 





.-b(XLXN_503 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_500[wid_TT+wid_fit-1:0]), 





- bprime (XLXN_501[wid_TT+wid_fit-1:0])); 








defparam XLXI_290.wid_TT=wid_TT; 


defparam XLX1I_290.wid_fit=wid_fit; 





sort2 XLXI_290 (.a(XLXN_491 [wid_TT+wid_fit-1:0]), 





-b (XLXN_488 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_490[wid_TT+wid_fit-1:0]), 








.bprime (XLXN_489 [wid_TT+wid_fit-1:0])); 


defparam XLXI_291.wid_TT=wid_TT; 





defparam XLXI_291.wid_fit=wid_fit; 





sort2 XLXI_291 (.a(XLXN_487 [wid_TT+wid_fit-1:0]), 





.b (XLXN_484 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_486[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_485 [wid_TT+wid_fit-1:0])); 





defparam XLX1I_298.wid_TT=wid_TT; 





defparam XLX1I_298.wid_fit=wid_fit; 





sort2 XLXI_298 (.a(XLXN_515[wid_TTt+wid_fit-1:0]), 





-b (XLXN_517 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_519[wid_TT+wid_fit-1:0]), 





.bprime (XLXN_518 [wid_TT+wid_fit-1:0])); 





defparam XLXI_299.wid_TT=wid_TT; 





defparam XLXI_299.wid_fit=wid_fit; 





sort2 XLXI_299 (.a(XLXN_520[wid_TTt+wid_fit-1:0]), 





-b (XLXN_521 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_524[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_522 [wid_TT+wid_fit-1:0])); 





defparam XLXI_300.wid_TT=wid_TT; 








defparam XLXI_300.wid_fit=wid_fit; 


sort2 XLXI_300 (.a(XLXN_525 [wid_TT+wid_fit-1:0]), 





-b (XLXN_526 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_527[wid_TT+wid_fit-1:0]), 





. bprime (XLXN_528 [wid_TT+wid_fit-1:0])); 





defparam XLXI_301.wid_TT=wid_TT; 
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rndll, 
w_outo, 


defparam XLXI_301.wid_fit=wid_fit; 





sort2 XLXI_301 (.a(XLXN_529[wid_TT+wid_fit-1:0]), 





.b (XLXN_530 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_531[wid_TT+wid_fit-1:0]), 





- bprime (XLXN_532 [wid_TT+wid_fit-1:0])); 





defparam XLXI_302.wid_TT=wid_TT; 





defparam XLXI_302.wid_fit=wid_fit; 





sort2 XLXI_302 (.a(XLXN_533[wid_TTt+wid_fit-1:0]), 





-b (XLXN_534 [wid_TT+wid_fit-1:0]), 





-aprime (XLXN_535[wid_TT+wid_fit-1:0]), 





.bprime (XLXN_536[wid_TT+wid_fit-1:0])); 





defparam XLXI_303.wid_TT=wid_TT; 





defparam XLXI_303.wid_fit=wid_fit; 





sort2 XLXI_303 (.a(XLXN_537 [wid_TTt+wid_fit-1:0]), 





-b(XLXN_538 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_539[wid_TT+wid_fit-1:0]), 





- bprime (XLXN_540 [wid_TT+wid_fit-1:0])); 








defparam XLXI_304.wid_TT=wid_TT; 


defparam XLXI_304.wid_fit=wid_fit; 





sort2 XLXI_304 (.a(XLXN_544 [wid_TTt+wid_fit-1:0]), 





.-b (XLXN_543 [wid_TT+wid_fit-1:0]), 





.aprime (XLXN_541[wid_TT+wid_fit-1:0]), 








- bprime (XLXN_542 [wid_TT+wid_fit-1:0])); 


endmodule 


module ga(rnd0O, rndl, rnd2, rnd3, rnd4, rnd5, rnd6, rnd7, rnd8, rnd9, rndlod, 
rndl2, rnd13, rnd1l4, rnd1l5, min_fit, crc_sigs, crosscode, CLR, VALID, ITER, CLK, 
w_outl, w_out2, w_out3, w_out4, w_out5, w_out6, w_out7, w_out8, w_out9, w_outl10, 


w_outl1, w_out12, w_out13, w_out14, w_outl15); 


rndlo, 


parameter wid_TT=14; 


parameter wid_fit=8; 


input [13:0] rnd0, rndl, rnd2, rnd3, rnd4, rnd5, rnd6é, rnd7, rnd8, rnd9, 
rndll, rnd1l2, rnd13, rnd1l4, rndl15; 


input [63:0] crc_sigs; 
input [13:0] crosscode; 
input [7:0] min_fit; 

input CLR, VALID, ITER; 


input CLK /* synthesis syn_noclockbuf=1 syn_maxfan=100000 */ ; 
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output [13:0] w_out0, w_outl, w_out2, w_out3, w_out4, w_out5, w_out6, 
w_out7, w_out8, w_out9, w_out10, w_outll, w_out12, w_out13, w_out14, w_out15; 

wire [31:0] crc_adr; 

wire [31:0] wcerc_adr; 

wire [13:0] w_out0O, w_outl, w_out2, w_out3, w_out4, w_out5, w_outé6, 
w_out7, w_out8, w_out9, w_out10, w_outll, w_out12, w_out13, w_out14, w_out15; 

wire [wid_TT+wid_fit-1:0] w—new0, w_newl, w_new2, w_new3, w_new4, w_new5, 
w_new6, w_new7, w_new8, w_new9, w_newl0, w_newll, w_newl2, w_newl3, w_newl4, w_newl15; 

wire [wid_TT+wid_fit-1:0] w_sort0, w_sortl, w_sort2, w_sort3, w_sort4, 
w_sort5, w_sort6, w_sort7, w_sort8, w_sort9, w_sortl0, w_sortll, w_sort12, w_sort13, 
w_sort14, w_sort15; 

//crc_sigs has the format of 32:1 CRC input value, 0 control line 

//cre adr has the format of 8:1 adr lines, 0 control line 

//T£ the control line is 1, the ROMs select the CRC address 

CRC_cale crce(crce_sigs[32:1], wcerc_adr) ; 

defparam ff_crc.n=32; 

dff_eNB ff_crc(wcrc_adr, VALID&ITER, CLK, crc_adr)j; 

defparam ff_crc_ctrl.n=1; 

dff_eNB ff_crc_ctrl(crc_sigs[0], VALID&ITER, CLK, werc_ctrl); 

assign tl = w_new0; 

assign t2 = min_fit; 

//strgen(prev0O, prevl, prev2, prev3, prev4, prev5, prev6, prev7, prevs, 
prev9, prevl0, prevll, prevl2, prevl1l3, prevl4, prev1l5, reset_val, 

// CLR, VALID, ITER, CLK, 

// r_nexto, r_nextl, r_next2, r_next3, r_next4, 
r_next5, r_next6, r_next7, r_nexts8, r_next9, r_next10,r_nextl1l, r_next12, r_next13, 
r_next14, r_next15); 

strgen sg(rnd0Q, rndl, rnd2, rnd3, rnd4, rnd5, rnd6é, rnd7, rnd8, rnd9, 
rnd10, rndll, rnd1l2, rnd13, rnd14, rndl15, 


w_out0[13:0], 
w_out5[13:0], 
w_out11[13:0], 


w_out3[13:0], 
w_out9[13:0], 
w_out15[13:0], 


w_out4[13:0], 
w_out10[13:0], 


w_out6[13:0], 
w_out12[13:0], 


w_out2[13:0], 
w_out8[13:0], 
w_out14[13:0], 


w_out1[13:0], 
w_out7[13:0], 
w_out13[13:0], 


min_fit, CLR, VALID, ITER, CLK, 
w_new0, w_newl, w_new2, w_new3, w_new4, w_new5, w_new6, 
w_new7, w_new8, w_new9, w_newl0, w_newll, w_newl2, w_newl3, w_newl4, w_newl15); 








defparam s.wid_TT=wid_TT; 


defparam s.wid_fit=wid_fit; 


175 


sort s(w_new0, w_newl, w_new2, w_new3, w_new4, w_new5, w_new6, w_new7, 
w_new8, w_new9, w_newl0, w_newll, w_newl2, w_newl3, w_newl4, w_newl5, CLK, 





w_sorto, w_sortl, w_sort2, w_sort3, w_sort4, w_sorts5, 
w_sort6, w_sort7, w_sort8, w_sort9, w_sortl10, w_sortll, w_sortl12, w_sort13, w_sortl14, 
w_sortl15); 


defparam c.wid_TT=wid_TT; 

defparam c.wid_fit=wid_fit; 

crossmut c(w_sort0, w_sortl, w_sort2, w_sort3, w_sort4, 
w_sort5, w_sort6, w_sort7, w_sort8, w_sort9, 
w_sortl10, w_sortll, w_sort12, w_sort13, 


w_sortl14, w_sortl15, 


{crc_adr[7:0], werc_ctrl}, crosscode, CLR, VALID, 
ITER, CLK, 

w_out0O[wid_TT-1:0], w_outl[wid_TT-1:0], 
w_out2[wid_TT-1:0], w_out3[wid_TT-1:0], w_out4 [wid_TT-1:0], w_out5[wid_TT-1:0], 
w_out6[wid_TT-1:0], w_out7[wid_TT-1:0], 

w_out8[wid_TT-1:0], w_out9[wid_TT-1:0], 
w_outl10[wid_TT-1:0], w_out1l1[wid_TT-1:0], w_outl12[wid_TT-1:0], w_out13[wid_TT-1:0], 


w_out14[wid_TT-1:0], w_out15[wid_TT-1:0]); 


endmodule 
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